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Preface 


Information for readers 


What is this book about and who is it for? 


This is a book on classical mechanics for university undergraduates. It aims to cover 
all the material normally taught in classical mechanics courses from Newton’s laws to 
Hamilton’s equations. If you are attending such a course, you will be unlucky not to 
find the course material in this book. 


What prerequisites are needed to read this book? 


It is expected that the reader will have attended an elementary calculus course and an 
elementary course on differential equations (ODEs). A previous course in mechanics 
is helpful but not essential. This book is self-contained in the sense that it starts from 
the beginning and assumes no prior knowledge of mechanics. However, in a general 
text such as this, the early material is presented at a brisker pace than in books that are 
specifically aimed at the beginner. 


What is the style of the book? 


The book is written in a crisp, no nonsense style; in short, there is no waffle! The object 
is to get the reader to the important points as quickly and easily as possible, consistent 
with good understanding. 


Are there plenty of examples with full solutions? 


Yes there are. Every new concept and technique is reinforced by fully worked exam- 
ples. The author’s advice is that the reader should think how he or she would do each 
worked example before reading the solution; much more will be learned this way! 


Are there plenty of problems with answers? 


Yes there are. At the end of each chapter there is a large collection of problems. For 
convenience, these are arranged by topic and trickier problems are marked with a star. 
Answers are provided to all of the problems. A feature of the book is the inclusion 
of computer assisted problems. These are interesting physical problems that cannot be 
solved analytically, but can be solved easily with computer assistance. 


Where can | find more information? 


More information about this book can be found on the book’s homepage 
http://www.cambridge.org/Gregory 


All feedback from readers is welcomed. Please e-mail your comments, corrections and 
good ideas by clicking on the comments button on the book’s homepage. 


xii Preface 


Information for lecturers 


Scope of the book and prerequisites 


This book aims to cover all the material normally taught in undergraduate mechanics 
courses from Newton’s laws to Hamilton’s equations. It assumes that the students have 
attended an elementary calculus course and an elementary course on ODEs, but no more. 
The book is self contained and, in principle, it is not essential that the students should 
have studied mechanics before. However, their lives will be made easier if they have! 


Inspection copy and Solutions Manual 


Any lecturer who is giving an undergraduate course on classical mechanics can request 
an inspection copy of this book. Simply go to the book’s homepage 


http://www.cambridge.org/Gregory 


and follow the links. 

Lecturers who adopt this book for their course may receive the Solutions Manual. 
This has a complete set of detailed solutions to the problems at the end of the chapters. 
To obtain the Solutions Manual, just send an e-mail giving your name, affiliation, and 
details of the course to solutions@cambridge.org 


Feedback 


All feedback from instructors and lecturers is welcomed. Please e-mail your comments 
via the link on the book’s homepage 


Acknowledgements 


I am very grateful to many friends and colleagues for their helpful comments and sug- 
gestions while this book was in preparation. But most of all I thank my wife Win for 
her unstinting support and encouragement, without which the book could not have been 
written at all. 


Part One 


NEWTONIAN MECHANICS 
OF A SINGLE PARTICLE 


CHAPTERS IN PART ONE 


Chapter 1 The algebra and calculus of vectors 

Chapter 2 Velocity, acceleration and scalar angular velocity 
Chapter 3 Newton’s laws and gravitation 

Chapter 4 Problems in particle dynamics 

Chapter 5 Linear oscillations and normal modes 

Chapter6 [Energy conservation 

Chapter 7 Orbits in a central field 


Chapter 8 Non-linear oscillations and phase space 


Chapter One 
a 


The algebra and calculus 
of vectors 


KEY FEATURES 
The key features of this chapter are the rules of vector algebra and differentiation of vector 
functions of a scalar variable. 


This chapter begins with a review of the rules and applications of vector algebra. 
Almost every student taking a mechanics course will already have attended a course on 
vector algebra, and so, instead of covering the subject in full detail, we present, for easy 
reference, a summary of vector operations and their important properties, together with a 
selection of worked examples. 

The chapter closes with an account of the differentiation of vector functions of a 
scalar variable. Unlike the vector algebra sections, this is treated in full detail. Applica- 
tions include the tangent vector and normal vector to a curve. These will be needed in 
the next chapter in order to interpret the velocity and acceleration vectors. 


1.1. VECTORS AND VECTOR QUANTITIES 


Most physical quantities can be classified as being scalar quantities or vector 
quantities. The temperature in a room is an example of a scalar quantity. It is so called 
because its value is a scalar, which, in the present context, means a real number. Other 
examples of scalar quantities are the volume of a can, the density of iron, and the pressure 
of air in a tyre. Vector quantities are defined as follows: 


Definition 1.1 Vector quantity If a quantity Q has a magnitude and a direction asso- 
ciated with it, then Q is said to be a vector quantity. [Here, magnitude means a positive 
real number and direction is specified relative to some underlying reference frame* that 
we regard as fixed.] 


The displacement of a particle’ is an example of a vector quantity. Suppose the 
particle starts from the point A and, after moving in a general manner, ends up at the 


* See section 2.2 for an explanation of the term ‘reference frame’. 
t A particle is an idealised body that occupies only a single point of space. 
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a 


FIGURE 1.1 Four different representations of each of the 
vectors a, b c form the twelve edges of the parallelopiped 
box. 


point B. The magnitude of the displacement is the distance AB and the direction of the 
displacement is the direction of the straight line joining A to B (in that order). Another 
example is the force applied to a body by a rope. In this case, the magnitude is the strength 
of the force (a real positive quantity) and the direction is the direction of the rope (away 
from the body). Other examples of vector quantities are the velocity of a body and the 
value of the electric (or magnetic) field. In order to manipulate all such quantities without 
regard to their physical origin, we introduce the concept of a vector as an abstract quantity. 


Definition 1.2 Vector A vector is an abstract quantity characterised by the two proper- 
ties magnitude and direction. Thus two vectors are equal if they have the same magnitude 
and the same direction.* 


Notation. Vectors are written in bold type, for example a, b, r or F. The magnitude of 
the vector a, which is a real positive number, is written | a|, or sometimes* simply a. 


It is convenient to define operations involving abstract vectors by reference to some 
simple, easily visualised vector quantity. The standard choice is the set of directed line 
segments. Each straight line joining two points (P and Q say, in that order) is a vector 


quantity, where the magnitude is the distance PQ and the direction is the direction of 
— 


Q relative to P. We call this the line segment PQ and we say that it represents some 
abstract vector a.* Note that each vector a is represented by infinitely many different line 
segments, as indicated in Figure 1.1. 


* In order that our set of vectors should have a standard algebra, we also include a special vector whose 
magnitude is zero and whose direction is not defined. This is called the zero vector and written 0. The 
zero vector is not the same thing as the number zero! 

T It is often useful to denote the magnitudes of the vectors a, b, c, ...by a, b, c, ..., but this does risk 
confusion. Take care! 

t The zero vector is represented by line segments whose end point and starting point are coincident. 
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R 
4 —3b 
a+b Q . > ~b 
a 2b 
P a—b 


FIGURE 1.2 Addition, subtraction and scalar multiplication of vectors. 


1.2 LINEAR OPERATIONS: a + b AND Aa 


Since vectors are abstract quantities, we can define sums and products of vectors 
in any way we like. However, in order to be of any use, the definitions must create some 
coherent algebra and represent something of interest when applied to a range of vector 
quantities. Also, our definitions must be independent of the particular representations 
used to construct them. The definitions that follow satisfy all these requirements. 


The vector sum a+ b 


Definition 1.3 Sum of vectors Let a and bt be any two vectors. Take any representa- 
tion PC Q of a and suppose the line segment OR repre esents b. Then the sum a-+ b of a 
and b is the vector represented by the line segment PR, as shown in Figure 1.2 (left). 


Laws of algebra for the vector sum 


@ b+a=a+b (commutative law) 


Gi) a+ (b+c)=(a+b+e (associative law) 


Definition 1.4 Negative of avector Let b be any vector. Then the vector with the same 
magnitude as b and the opposite direction is called the negative of b and is written —b. 
Subtraction by b is then defined by 


a—b=a+4+(-b). 


[That is, to subtract b just add —b, as shown in Figure 1.2 (centre).] 


The scalar multiple Aa 


Definition 1.5 Scalar multiple Let a be a vector and x be a scalar (a real number). 
Then the scalar multiple Xa is the vector whose magnitude is |i||a| and whose direction 
is 
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(i) the same as a if x is positive, 
(11) undefined if x is zero (the answer is the zero vector), 
(iii) the same as —a if X is negative. 


It follows that —(Aa) = (—A)a. 


Laws of algebra for the scalar multiple 


(i) A(ua) = Ap)a (associative law) 
(i) A(a+b)=da+drb and A+pja=dAa+ua (distributive laws) 


The effect of the above laws is that linear combinations of vectors can be manipu- 
lated just as if the vectors were symbols representing real or complex numbers. 


Example 1.1 Laws for vector sum and scalar multiple 
Simplify the expression 3(2a — 4b) — 2(2a — b). 


Solution 
On this one occasion we will do the simplification by strict application of the laws. It 
is instructive to decide which laws are being used at each step! 


3(2a — 4b) — 2(2a— b) = 3(24 + (-4)b) ET (-2)(2a a (-1)5) 
= (6a + (—12)b) + ((-4)a +25) 


= (6a + (4a) + ((-12)6 + 2) 
=2a+(—10)b=2a—10b.m 


Unit vectors 


A vector of unit magnitude is called a unit vector. If any vector a is divided by its own 
magnitude, the result is a unit vector having the same direction as a. This new vector is 
denoted by @ so that 


G@=a/lal. 


Basis sets 


Suppose a and Bb are two non-zero vectors, with the direction of b neither the same nor 
— 


—> 
opposite to that of a. Let OA, OB be representations of a, b and let P be the plane 
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O a A AQ 


FIGURE 1.3 The set {a, b} is a basis for all vectors lying in the plane O AB. 


containing the triangle OAB. Then (see Figure 1.3) any vector v whose representation 
—> 


OV lies in the plane P can be written in the form 
v=ha+ wb, (1.1) 


where the coefficients A, jz are unique. Vectors that have their directions parallel to the 
same plane are said to be coplanar. Thus we have shown that any vector coplanar with a 
and b can be expanded uniquely in the form (1.1). It is also apparent that this expansion 
set cannot be reduced in number (in this case to a single vector). For these reasons the pair 
of vectors {a, b} is said to be a basis set for vectors lying* in the plane P. 

Suppose now that {a, b, c} is a set of three non-coplanar vectors. Then any vector v, 
without restriction, can be written in the form 


v=dAa+pub+ve, (1.2) 


where the coefficients i, 41, v are unique. In this case we say that the set {a, b, c} is a basis 
set for all three-dimensional vectors. Although any set of three non-coplanar vectors forms 
a basis, it is most convenient to take the basis vectors to be orthogonal unit vectors. In this 
case the basis set’ is usually denoted by {i, j, k} and is said to be an orthonormal basis. 
The representation of a general vector v in the form 


v=Ai+uj+vk 


is common in problem solving. 

In applications involving the cross product of vectors, the distinction between right- 
and left-handed basis sets actually matters. There is no experiment in classical mechanics 
or eletromagnetism that can distinguish between right- and left-handed sets. The differ- 
ence can only be exhibited by a model or some familiar object that exhibits ‘handedness’, 
such as a corkscrew.* Figure 1.4 shows a right-handed orthonormal basis set attached 
to a well known object. 


* Strictly speaking vectors are abstract quantities that do not /ie anywhere. This phrase should be taken to 
mean ‘vectors whose directions are parallel to the plane P’. 


¥ Tt should be remembered that there are infinitely many basis sets made up of orthogonal unit vectors. 
_—>-l Oe > 
+ Suppose that the non-coplanar vectors {a, b, c} have representations OA, OB, OC respectively. Place 


an ordinary corkscrew with the screw lying along the line through O perpendicular to the plane OAB, 


8 Chapter 1 The algebra and calculus of vectors 


FIGURE 1.4 A standard basis set {i, j, k} is 
both orthonormal and right-handed. 


FIGURE 1.5 The points A, B, C have position O 
vectors a, b, c relative to the origin O. 


b B 
Definition 1.6 Standard basis set If an orthonormal basis {i, j,k} is also right- 
handed (as shown in Figure 1.4), we will call it a standard basis. 


Position vectors and vector geometry 


Suppose that O is a fixed point of space. Then relative to the origin O (and relative to the 


underlying reference frame), any point of space, such as A, has an associated line segment, 
— 


OA, which represents some vector a. Conversely, the vector a is sufficient to specify the 
position of the point A. 


Definition 1.7 Position vector The vector a is called the position vector of the point 
A relative to the origin O, [It is standard practice, and very convenient, to denote the 
position vectors of the points A, B,C,...by a, b, ce, and so on, as shown in Figure 1.5.] 


Since vectors can be used to specify the positions of points in space, we can now use 
the laws of vector algebra to prove* results in Euclidean geometry. This is not just an aca- 
demic exercise. Familiarity with geometrical concepts is an important part of mechanics. 
We begin with the following useful result: 


and the handle parallel to OA. Now turn the corkscrew until the handle is parallel to OB and note the 
direction in which the corkscrew would move if it were ‘in action’. ( The direction of the turn must be 
such that the angle turned through is at most 180°.) If OC makes an acute angle with this direction, the 
set {a, b, c} (in that order) is right-handed; if OC makes an obtuse angle with this direction then the set 


is left-handed. 
* Some properties of Euclidean geometry have been used to prove the laws of vector algebra. However, 
this does not prevent us from giving valid proofs of other results. 
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FIGURE 1.6 The point X divides the line AB 
in the ratio A: jp. O 


Example 1.2 Point dividing a line in a given ratio 


The points A and B have position vectors a and 5 relative to an origin O. Find the 
position vector x of the point X that divides the line AB in the ratio 4 : yu (that is 
AX/XB=A/p). 


Solution 
It follows from Figure 1.6 that x is given by* 


se x — 
rat Ak=a+ (=) AB 
A+u 


x a+b 
a+ b-—a)= : 
(a) A+ UL 


In particular, the mid-point of the line AB has position vector 5(a + 5b). a 


Example 1.3 Centroid of a triangle 


Show that the three medians of any triangle meet in a point (the centroid) which 
divides each of them in the ratio 2:1. 


Solution 

Let the triangle be ABC where the points A, B, C have position vectors a, b, c 
relative to some origin O. Then the mid-point P of the side BC has position vector 
p= 5(b +c). The point X that divides the median AP in the ratio 2:1 therefore has 
position vector 


a+2p a+bte 
an a 


The position vectors of the corresponding points on the other two medians can be 
found by cyclic permutation of the vectors a, b, c and clearly give the same value. 
Hence all three points are coincident and so the three medians meet there. Hl 


— 
* Strictly speaking we should not write expressions like a+ AX since the sum we defined was the sum 


of two vectors, not a vector and a line segment. What we really mean is ‘the sum of a and the vector 
— 
represented by the line segment AX’. Pure mathematicians would not approve but this notation is so 


convenient we will use it anyway. It’s all part of living dangerously! 


10 
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FIGURE 1.7 The bisector theorem: 
AP/PB = OA/OB. A P B 


1.3 


Example 1.4 The bisector theorem 


In a triangle O AB, the bisector of the angle AOB meets the line AB at the point P. 
Show that AP/PB = OA/OB. 


Solution 

Let the vertex O be the origin of vectors* and let the position vectors of the vertices 
A, B relative to O be a, b as shown in Figure 1.7. The point with position vector 
a+ b does not lie the bisector O P in general since the vectors a and b have different 
magnitudes a and b. However, by symmetry, the point with position vector G+ does 
lie on the bisector and a general point X on the bisector has a position vector x of the 
form 


ie a b ba+ab ba+ab 
saa@rDaa(ts)=("5—)= (FE) 
where K = ab/d is a new constant. Now X will lie on the line AB if its position 


vector has the form (ja + Ab)/(A + 2), that is, if K = a +b. Hence the position 
vector p of P is 


_ ba+ab 
~ a+b 


Moreover we see that P divides that line AB in the ratio a : b, that is, AP/PB = 
OA/OB as required. @ 


THE SCALAR PRODUCT a-b 


—> 

Definition 1.8 Scalar product Suppose the vectors a and b have representations OA 
—> 

and OB. Then the scalar product a - b of a and b is defined by 


a-b=|a||b|cos@, (1.3) 


where 0 is the angle between O A and OB. [Note that a - b is a scalar quantity. ] 


* One can always take a special point of the figure as origin. The penalty is that the symmetry of the 
labelling is lost. 
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Laws of algebra for the scalar product 


(i) b-a=a-b (commutative law) 
(ii) a-(b+c)=a-b+a-c (distributive law) 
(iii) (Aa) - b= (a- b) (associative with scalar multiplication) 


Properties of the scalar product 


(i) a-a=|al’. 
(ii) The scalar product a - b = 0 if (and only if) a and b are perpendicular (or one of 
them is zero). 
aii) If {i, 7, k} is an orthonormal basis then 


i-i=jr-j=k-k=1, i-j=j-k=k-i=0. 
(iv) Ifa, =Ayit mys + vy, k and az = Adi + aj + v2k then 
4, +42 =AjAQ + Mhz + V1V2. 


Example 1.5 Numerical example on the scalar product 


Ifa = 2i — j + 2k and b = 4i — 3k, find the magnitudes of a and b and the angle 
between them. 


Solution 

|a|? =a-a = (2i—j+2k)-(2i-j+2k) =2?+(-1)* +2? = 9. Hence |a| = 3. 

Similarly | b|? = 47+0*+(—3)? = 25 so that | b| = 5. Also a-b = 8+-0+(—6) = 2. 

Since a- b = |a||b| cos 90, it follows that 2 = 3 x 5 x cos@ so that cos@ = 2/15. 
Hence the magnitudes of a and b are 3 and 5, and the angle between them is 

cos~!(2/15). ml 


Example 1.6 Apollonius’s theorem 


In the triangle OAB, M is the mid-point of AB. Show that (OA)? + (OB)? = 
2(OM)* + 2(AM)?. 


Solution 
Let the vertex O be the origin of vectors and let the position vectors of A and B be a 
and b. Then the position vector of M is x(a + b). Then 


4(OM)? = |a+b|* = (a+b)- (a+b) 
=a-a+b-b+2a-b=|al?+|b\>+2a-b 
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FIGURE 1.8 The component of v in the 
direction of the unit vector n is equal to OV’, 
the projection of OV onto the line through 
O parallel to n. 


and 


4(AM)* = (AB)* = |a— b|* = (a— b)- (a— BD) 
=a-at+b-b—2a-b=|al|? +|b|* —2a-b. 


Hence 
2(OM)* + 2(AM)? = |a|? + |b|? = (OA)? + (OBY* 


as required. i 


Components of a vector 


Definition 1.9 Components of a vector Let n be a unit vector. Then the component 
of the vector v in the direction of n is defined to be v +n. The component of v in the 
direction of a general vector a is therefore v +@. 


Properties of components 


—> 
(1) The component v - 7 has a simple geometrical significance. Let OV be a represen- 
tation of v as shown in Figure 1.8. Then 


v-n=|v||n| cos? = OV cosé = OV’, 


where OV’ is the projection of OV onto the line through O parallel to n. 
(11) Suppose that v is a sum of vectors, v = v; + v2 + v3 say. Then the component of 
v in the direction of n is 


vV-n= (v1 + 02+ 03)- n= (Vi -N)+(V2:Nn)+(03°N), 


by the distributive law for the scalar product. Thus, the component of the sum of a 
number of vectors in a given direction is equal to the sum of the components of the 
individual vectors in that direction. 

(iii) If a vector v is expanded in terms of a general basis set {a, b, c} in the form v = 
Aa+ub-+Vve, the coefficients A, 4, v are not the components of the vector v in the 
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axb 


FIGURE 1.9 The vector product 


A 
ax b = (\a||b| sind) n. O 


directions of a, b, c. However if v is expanded in terms of an orthonormal basis set 
{i, j, k} inthe form v = Ai+yj+vk, then the component of v in the i-direction 
is 


v-f=(Aitpstvk) -i=1G-D+pG-)+v(k-d 
=—AA0Ga a, 


Similarly jz and v are the components of v in the j- and k-directions. Hence when 
a vector v is expanded in terms of an orthonormal basis set {i, j, k} in the form 
v=di+pujt+vk, the coefficients X, u,v are the components of v in the i- j- 
and k-directions. 


Example 1.7 Numerical example on components 


Ifv = 6i—37+ 15k anda = 2i — j —2k, find the component of v in the direction 
of a. 


Solution 
|a|? =a-a=2* + (-1)? 4+ (—2)* = 9. Hence | a| = 3 and 


a 2i-j-2k 
ja| 3 : 


a= 


The required component of v is therefore 


v-@= (61-3) +154) ( 


2i-f—2k\ _ 1243-30. 
3 3 


1.4 THE VECTOR PRODUCT axb 


> 
Definition 1.10 Vector product Suppose the vectors a and b have representations OA 


—> 
and OB and let n be the unit vector perpendicular to the plane OAB and such that 
{a, b, n} is a right-handed set. Then the vector product ax b of a and b is defined by 


axb= (\a\||b|sin@)n, (1.4) 
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where 0 (0 < @ < 180°) is the angle between OA and OB. [Note that a x b is a vector 
quantity. | 


Laws of algebra for the vector product 


(i) bxa=-axb (anti-commutative law) 
(ii) ax(b+c)=axb+axc (distributive law) 
(iii) (Aa) x b = X(axb) (associative with scalar multiplication) 


Since the vector product is anti-commutative, the order of the terms in vector prod- 
ucts must be preserved. The vector product is not associative. 


Properties of the vector product 


(i) axa=0. 
(ii) The vector product ax b = 0 if (and only if) a and b are parallel (or one of them is 
Zero). 
aii) If {i, 7, k} is a standard basis then 


ixj=k, kxi=j, jxk=i, ixi=jxj=kxk=0. 
(iv) Ifa, =Ayit my + vk and az = Adi + oj + v2k then 
ijk 
a, Xa2=}A; 1 VY 
A2 M2 v2 


where the determinant is to be evaluated by the first row. 


Example 1.8 Numerical example on vector product 


Ifa = 2i — j+2k and b= —i — 3k, find a unit vector perpendicular to both a and 
b. 


Solution 
The vector a x b is perpendicular to both a and b. Now 


ij k 
axb=| 2-1 2 
—1 0-3 

= (3 —0)i — ((-6) — (-2)) 7 + O-Dk 
=31+4j-k. 


The magnitude of this vector is (Ge cdl ts (—1)?) UP (26)!/2. Hence the required 
unit vector can be either of + (3i +4 j — k) /(26)'/*. m 
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1.5 TRIPLE PRODUCTS 


Triple products are not new operations but are simply one product followed by 
another. There are two kinds of triple product whose values are scalar and vector respec- 
tively. 


Triple scalar product 


An expression of the form a - (bx c) is called a triple scalar product; its value is a scalar. 


Properties of the triple scalar product 
(i) 
a-(bxc)=c-(axb) =b: (exa), (1.5) 


that is, cyclic permutation of the vectors a, b, c in a triple scalar product leaves its 
value unchanged. [Interchanging two vectors reverses the sign. ] 
This formula can alternatively be written 


a-(bxc) = (axb)-c, (1.6) 


that is, interchanging the positions of the ‘dot’ and the ‘cross’ in a triple scalar 
product leaves its value unchanged. 

Because of this symmetry, the triple scalar product can be denoted unambiguously 
by La, b,c]. 

(ii) The triple scalar product [a, b, c] = 0 if (and only if) a, b, c are coplanar (or one 
of them is zero). In particular a triple scalar product is zero if two of its vectors are 
the same. 

aii) If [a, b, c] > 0 then the set {a, b, c} is right-handed. If [a, b, c] < 0 then the set 
{a, b, c} is left-handed. 


(iv) Ifay =Ayit mrs + vik, a2 = A2i + way + v2k, a3 = A3i + 37 + 13k, where 
{i, j, k} is a standard basis, then 


At 1 V4 
a1, a2, 43] = | A2 M2 V2}. Cl.7) 
A3 3 V3 


Triple vector product 


An expression of the form ax(bxc) is called a triple vector product; its value is a vector. 


Property of the triple vector product 
Since bx c is perpendicular to both b and c, it follows that a x (b x c) must lie in the 
same plane as b and c. It can therefore be expanded in the form Aa + wb. The actual 
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formula is 
ax(bxc) = (a-c)b—(a-b)c. (1.8) 


Since the vector product is anti-commutative and non-associative, it is wise to use this 
formula exactly as it stands. 


Example 1.9 Using triple products 


Expand the expression (a x b) - (ex d) in terms of scalar products. 


Solution 
Use the triple scalar product formula (1.6) to interchange the first ‘dot’ and ‘cross’, 
and then expand the resulting triple vector product by the formula (1.8), as follows: 


(ax b)-(cxd) =a-[bx(cxd)|=a-[(b-d)c— (b-c)d| 
=(a-c)(b- d)—(a-d)(b-c)@ 


1.6 VECTOR FUNCTIONS OF A SCALAR VARIABLE 


In practice, the value of a vector quantity often depends on a scalar variable such 
as the time f. For example, if A is the label of a particle moving through space, then its 
position vector a (relative to a fixed origin O) will vary with time, that is, a = a(t). The 
vector a is therefore a function of the scalar variable f. 

The time dependence of a vector need not involve motion. The value of the electric 
or magnetic field at a fixed point* of space will generally vary with time so that E = E(t) 
and B = B(t). More generally, the scalar variable need not be the time. Consider the 
space curve C shown in Figure 1.10, whose points are parametrised by the parameter a. 
Each point of the curve has a unique tangent line whose direction can be characterised by 
the unit vector ¢. This is called the unit tangent vector to C and it depends on aq, that is, 
t = ¢(q@). In this case the independent variable is the scalar @ and (just to confuse matters) 
the dependent variable is the vector ¢. 


Differentiation 


The most important operation that can be carried out on a vector function of a scalar 
variable is differentiation. 


Definition 1.11 Differentiation of vectors Suppose that the vector v is a function of 
the scalar variable a, that is, v = v(a). Then the derivative of the function v(a) with 
respect to a is defined by the limit' 


dv . v(a + Aq) — v(a) 
—= lim : 
da Aa—>0 Aa 


(1.9) 


* ‘We will not be concerned here with vector functions of position. These are called vector fields. 
+ Mathematical note: The statement u(a) > U as a > A means that ju(a) -U| > Oasa—> A. 
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This looks identical to the definition of the derivative of an ordinary real function, but there 
is a difference. When a changes to a+ Aq, the function v changes from v(q@) to v(a+Aq), 
a difference of v(~a+ Aa)—v(a). However, this ‘difference’ now means vector subtraction 
and its value is a vector; it remains a vector after dividing by the scalar increment Aq. 
Hence dv/da, the limit of this quotient as Aw — 0, is a vector. Furthermore, since 
dvu/da depends on a, it is itself a vector function of the scalar variable aw. The rules for 
differentiating combinations of vector functions are similar to those for ordinary scalar 
functions. 


Differentiation rules for vector functions 


Let u(a) and v(a) be vector functions of the scalar variable a, and let A(@) be a 
scalar function. Then: 


a 


(Au) =Au+aAu 
da 


Gene eee (ii) 
da 

ie: 2 : : ee: : : 

Gil) —(u-v) =u-v+u-v (iv) —(uxv)=Uuxv+uxd 
da da 


where u means du/da and so on. Note that the order of the terms in the vector 
product formula must be preserved. 


Example 1.10 Differentiating vector functions 


(i) The position vector of a particle P at time ¢ is given by 
r= QP =5)i+ 442 740° k, 


where {i, j, k} is a constant basis set. Find dr/dt and d*r/dt?. (These are the 
velocity and acceleration vectors of P at time f.) 


(iii) If a = a(t) and bis a constant vector, show that 
= (ax b)] (ax b) 
—l[a-(a =a-(a . 
dt 
Solution 
(i) Since i, j, k are constant vectors, it follows from the differentiation rules that 
2 


d-r : 


dr _ 


= (4 —5)i+4j4 377k, 
[ ( yi+4gy+ 
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_ chord joining A and A 


tangent line at A 


FIGURE 1.10 The unit tangent vector ¢(@) at a typical point A on the curve C, 
defined parametrically by r = r(a). 


(ii) 


a ee ee ee 
ql’ (ax )] =a-(ax ae ( Zax )) = +a- (ax + axb) 


=a-(axb+ax0) =a- (axb), 


as required. i 


1.7 TANGENT AND NORMAL VECTORS TO A CURVE 


In the next chapter we will define the velocity and acceleration of a particle mov- 
ing in a space of three dimensions. In order to be able to interpret these definitions, we 
need to know a little about the differential geometry of curves. In particular, it is useful to 
know what the unit tangent and unit normal vectors of a curve are. 


Unit tangent vector 


Consider the curve C shown in Figure 1.10 which is defined by the parametric equation 

r = r(qa). In general this can be a curve in three-dimensional space. Let A be a typical 

point of C corresponding to the parameter w and A’ a nearby point corresponding to the 
— 


parameter a + Aw. The chord AA’ represents the vector 


Ar=r(a+ Aq) — r(a) 


—>=> 
and so Ar/| Ar| is a unit vector parallel to the chord AA’. The unit tangent vector ¢(q) 
at the point A is defined to be the Jiixit of this expression as A’ > A, that is 


Hah ter 
= im —m—. 
Be he (API 
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20 


> © 
at 2am 


FIGURE 1.11 The cycloid x = a(@ — sin@), y = a(1 — cos), 
z = 0, where 0 < 9 < 27. 


The tangent vector ¢ is related to the derivative dr/da since 


dr ; Ar . Ar . |Ar| 
—= lm —= lim x lim 
da  Aa>0Aa Aa>0|Ar| Aa>0 Aa 


: Ar dr 
= t(a) x | lim —]|=f¢(a) x |—}], 
Aa>0 Aa da 
that is, 
dr dr 
— = |—| ft(a). 1.10 
da da a) ( ) 


Example 1.11 Finding the unit tangent vector 


Figure 1.11 shows the cycloid x = a(@ — sin@), y = a(1 — cos@), z = O, where 
0 < 6 < 2z. Find the unit tangent vector to the cycloid at the point with parameter 
0. 


Solution 


Let i, j be unit vectors in the directions Ox, Oy respectively. Then the vector form 
of the equation for the cycloid is 


r=a(6 —sin9)i+a(1 —cos8@) j. 


Then 
Gs = a(1 —cos@)i + (asin@) j 
dé 
and 
dr 


=a (2—2cos6)'/? = 2a sin 40. 


dO 
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Hence the unit tangent vector to the cycloid is 
dr dr ssi tis ws 
t(0) = 7) 70 = (sin 56)i + (cos 56) J, 


after simplification. 


The formula (1.10) takes its simplest form when the parameter @ is taken to be s, the 
distance along the curve measured from some fixed point. In this case, 


dr 


| Ar| 
ds 


= 1 
As>0 As 


so that ¢ (pointing in the direction of increasing s) is given by the simple formula 


t=—. (1.11) 


This is the most convenient formula for theoretical purposes. 


Unit normal vector 


Let t(s) be the unit tangent vector to the curve C, where the parameter s represents distance 
along the curve. Then, since ¢ is a vector function of the scalar variable s, it has a derivative 
dt/ds which is another vector function of s. 

Since ¢ is a unit vector it follows that t(s)+t(s) = 1 and if we differentiate this identity 
with respect to s, we obtain 


0= sae i t+t a 
~ ds ~ ds ds 


d 
=2(Z 4), 
ds 


a (1.12) 
—=Kkn : 
ds 

where k = |dt/ds|, a positive scalar called the curvature, and n is a unit vector called 


the (principal) unit normal vector. At each point of the curve, the unit vectors ¢(s) and 
n(s) are mutually perpendicular. 

The quantities n and « have a nice geometrical interpretation. Let A be any point on 
the curve and suppose that the distance parameter s is measured from A. Then, by Taylor’s 
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theorem, the form of the curve C near A is given approximately by 


dr 1.2 d*r 3 
ra)=ro+s|7] + 78 o ee ). 


that is, 
r(s) =a+st+ (3x5?) n +0 (o°), (1.13) 


where a is the position vector of the point A, and ¢, « and n are evaluated at the point A. 
Thus, near A, the curve C lies* in the plane through A parallel to the vectors t and n. We 
can also see from equation (1.13) that, near A, the curve C is approximately a parabola. 
To the same order of approximation, it is equally true that, near A, the curve C is given by 


r(s) =at+x« !(sinks)t +«~! (1 —cosxs)n +O (s°) ; (1.14) 


Thus, near A, the curve C is approximately a circle of radius «~!; the vector ¢ is tangential 
to this circle and the vector n points towards its centre. The radius «~! is called the radius 
of curvature of C at the point A. 


Example 1.12 Finding the unit normal vector and curvature 


Find the unit normal vector and curvature of the cycloid x = a(@ — sin@), y = 
a(1 —cos@), z = 0, where 0 < 6 < 27. 


Solution 
The tangent vector to the cycloid has already been found to be 


d d 
t(0) = a/\ = (sin 40)i + (cos 46) j. 
Hence, by the chain rule, 


dt dt/d0  dt/dd (cos 50)i — 4(sin 50) j 
ds ds/d@ \dr/d0| — 2a sin 40 


= (4a sin se) ((cos 19) i — (sin 40) i) , 


Hence the unit normal vector and curvature of the cycloid are given by 
= 
n(0) = (cos 30) i — (sin 36) j, (0) = (40 sin 40) 


The radius of curvature of the cycloid is therefore 4a sin 50. | 


* More precisely, this plane makes three point contact with the curve C at the point A. 
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Problems on Chapter 1 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


1.1 In terms of the standard basis set {i, j,k}, a = 2i — j — 2k, b = 31 —4k ande = 
i—Sj+3k. 


(i) Find 3a + 2b — 4c and |a — b|?. 
(ii) Find |a|, | b| and a - b. Deduce the angle between a and b. 
(iii) Find the component of c in the direction of a and in the direction of b. 
(iv) Find ax b, bx c and (ax b) x (bx c). 
(v) Find a- (bx c) and (ax b) - c and verify that they are equal. Is the set {a, b, c} right- 
or left-handed? 
(vi) By evaluating each side, verify the identity ax (bx c) = (a- c)b—(a-b)c. 


Vector geometry 


1.2 Find the angle between any two diagonals of a cube. 


1.3 ABCDEF is aregular hexagon with centre O which is also the origin of position vectors. 
Find the position vectors of the vertices C, D, E, F in terms of the position vectors a, b of A 
and B. 


1.4 Let ABCD be a general (skew) quadrilateral and let P, Q, R, S be the mid-points of the 
sides AB, BC, CD, DA respectively. Show that PQRS is a parallelogram. 


1.5 Ina general tetrahedron, lines are drawn connecting the mid-point of each side with the 
mid-point of the side opposite. Show that these three lines meet in a point that bisects each of 
them. 


1.6 Let ABCD bea general tetrahedron and let P, Q, R, S be the median centres of the faces 
opposite to the vertices A, B, C, D respectively. Show that the lines AP, BQ, CR, DS all 
meet in a point (called the centroid of the tetrahedron), which divides each line in the ratio 3:1. 


1.7 A number of particles with masses m , m2, m3, ... are situated at the points with position 
vectors r,, '2, 3, ... relative to an origin O. The centre of mass G of the particles is defined 
to be the point of space with position vector 


= mr, +mor2+m3r3+--- 
my+m2+m3+-::: 


Show that if a different origin O’ were used, this definition would still place G at the same 
point of space. 


1.8 Prove that the three perpendiculars of a triangle are concurrent. 
[Construct the two perpendiculars from A and B and take their intersection point as O, the origin of position 


vectors. Then prove that OC must be perpendicular to AB.] 
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Vector algebra 


1.9 Ifa) =Ayitmiftvik, a2 = Ait prj +vok, a3 = 131+ “3 j +3k, where {i, J, k} 
is a standard basis, show that 


At M1 VI 
a, + (€2X a3) = |A2 2 V2]. 
A3 M3 V3 


Deduce that cyclic rotation of the vectors in a triple scalar product leaves the value of the 
product unchanged. 


1.10 By expressing the vectors a, b, c in terms of a suitable standard basis, prove the identity 
ax(bxc) = (a-c)b— (a: b)c. 


1.11 Prove the identities 


(i) (ax b)- (cx d) = (a- c)(b+ d) — (a- d)(b- cc) 
(ii) (ax b)x(cexd) =[a,b,d|c—|[a,b,cl|d 
(iii) ax (bx c) + cx (axb)+ bx(cxa)=0 (Jacobi’s identity) 


1.12 Reciprocal basis Let {a, b, c} be any basis set. Then the corresponding reciprocal 
basis {a*, b*, c*} is defined by 


vu bxe e cxa e axb 
~ [aby el ~~ [a,bey ~~ [a,c] 


(i) If {i, 7, k} is a standard basis, show that {i*, j*, k*} = {i, j, k}. 
(ii) Show that [a*, b*, c*] = 1/[a, b, c]. Deduce that if {a, b, c} is aright handed set then 
so is {a*, b*, c*}. 
(iii) Show that {(a*)*, (b*)*, (c*)*] = {a, b, c}. 
(iv) If a vector v is expanded in terms of the basis set {a, b, c} in the form 
v=Aa+pub+ve, 


show that the coefficients A, 4, v are given byA = v-a*,w=v-b*,v=v0-c*. 


1.13 Lamé’s equations The directions in which X-rays are strongly scattered by a crystal are 
determined from the solutions x of Lamé’s equations, namely 


x-a=L, x-b=M, x-c=N, 


where {a, b, c} are the basis vectors of the crystal lattice, and L, M, N are any integers. Show 
that the solutions of Lamé’s equations are 


x=La+Mb+NC,” 


where {a*, b*, c*} is the reciprocal basis to {a, b, c}. 
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Differentiation of vectors 
1.14 If r(t) = Bt? —Dit+e j +(t+3)k, where {i, j, k} is a constant standard basis, find 


r and r. Deduce the time derivative of rx r. 


1.15 The vector v is a function of the time ¢ and k is a constant vector. Find the time deriva- 
tives of (i) | v|, (ii) (v + k) v, (iii) [v, 0, k]. 


1.16 Find the unit tangent vector, the unit normal vector and the curvature of the circle x = 
acos@, y =asin6@, z = Oat the point with parameter 0. 


1.17 Find the unit tangent vector, the unit normal vector and the curvature of the helix x = 
acos@, y =asin0@, z = bé at the point with parameter 0. 


1.18 Find the unit tangent vector, the unit normal vector and the curvature of the parabola 
x = ap’, y = 2ap,z = Oat the point with parameter p. 


Chapter Two 
a | 


Velocity, acceleration 
and scalar angular velocity 


KEY FEATURES 
The key concepts in this chapter are the velocity and acceleration of a particle and the angular 
velocity of a rigid body in planar motion. 


Kinematics is the study of the motion of material bodies without regard to the forces 
that cause their motion. The subject does not seek to answer the question of why bod- 
ies move as they do; that is the province of dynamics. It merely provides a geometrical 
description of the possible motions. The basic building block for bodies in mechanics is 
the particle, an idealised body that occupies only a single point of space. The impor- 
tant kinematical quantities in the motion of a particle are its velocity and acceleration. 
We begin with the simple case of straight line particle motion, where velocity and accel- 
eration are scalars, and then progress to three-dimensional motion, where velocity and 
acceleration are vectors. 

The other important idealisation that we consider is the rigid body, which we regard 
as a collection of particles linked by a light rigid framework. The important kinematical 
quantity in the motion of a rigid body is its angular velocity. In this chapter, we con- 
sider only those rigid body motions that are essentially two-dimensional, so that angular 
velocity is a scalar quantity. The general three-dimensional case is treated in Chapter 16. 


2.1. STRAIGHT LINE MOTION OF A PARTICLE 


Consider a particle P moving along the x-axis so that its displacement x from the 
origin O is a known function of the time t. Then the mean velocity of P over the time 


O P 
—__+—___@—_____» r 


—_— PS) 


FIGURE 2.1 The particle P moves in a straight line and 
has displacement x and velocity v at time f. 
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interval t) < ¢t < fo is defined to be the increase in the displacement of P divided by the 
time taken, that is, 


x(t2) — x(t) 


24 
are (2.1) 


Example 2.1 Mean velocity 


Suppose the displacement of P from O at time t is given by x = t* — 6t, where x 
is measured in metres and ¢ in seconds. Find the mean velocity of P over the time 
interval 1 < ¢ < 3. 


Solution 


In this case, x(1) = —5 and x(3) = —9 so that the mean velocity of P is ((—9) — 
(—5))/(8 -1) =—2ms"!. 


The mean velocity of a particle is less important to us than its instantaneous velocity, 
that is, its velocity at a given instant in time. We cannot find the instantaneous velocity of 
P at time ¢; merely by letting t2 = f; in the formula (2.1), since the quotient would then 
be undefined. However, we can define the instantaneous velocity as the /imit of the mean 
velocity as the time interval tends to zero, that is, as f2 — t,. Thus v(f,), the instantaneous 
velocity of P at time ¢ can be defined by 


v(t) = Jim (ea) ; 


27 to —- ty 
But this is precisely the definition of dx /dt, the derivative of x with respect to t, evaluated 
at t = t,. This leads us to the official definition: 
Definition 2.1 1-D velocity The (instantaneous) velocity v of P, in the positive x- 


direction, is defined by 


_ dx 


v= a (2.2) 


The speed of P is defined to be the rate of increase of the total distance travelled and is 
therefore equal to | v|. 
Similarly, the acceleration of P, the rate of increase of v, is defined as follows: 


Definition 2.2 1-D acceleration The (instantaneous) acceleration a of P,, in the posi- 
tive x-direction, is defined by 


a=—=—,. (2.3) 


Example 2.2 Finding rectilinear velocity and acceleration 


Suppose the displacement of P from O at time ¢ is given by x = t? — 6t? +4, where 
x is measured in metres and ¢ in seconds. Find the velocity and acceleration of P at 
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time t. Deduce that P comes to rest twice and find the position and acceleration of P 


at the later of these two times. 


Solution 
Since v = dx/dt anda = dv/dt, we obtain 


v=3t7—-12r and a=6r—12 


as the velocity and acceleration of P at time f. 
P comes to rest when its velocity v is zero, that is, when 


3¢2 — 12t = 0. 


This is a quadratic equation for ¢ having the solutions ¢ = 0, 4. Thus P is at rest when 
t=Osandt=4s. 

When t = 4s, x = —28 manda = 12 ms~”. Note that merely because v = 0 
at some instant it does not follow that a = 0 also. 


Example 2.3 Reversing the process 


A particle P moves along the x-axis with its acceleration a at time f given by 
a= 12t7-6t+6ms ”. 


Initially P is at the point x = 4 m and is moving with speed 8 ms~! in the negative 
x-direction. Find the velocity and displacement of P at time ¢. 


Solution 


Since a = dvu/dt we have 


and integrating with respect to ¢ gives 
v = 46 — 317 + 6 +C, 


where C is a constant of integration. This constant can be determined by using the 
given initial condition on v, namely, v = —8 whent = 0. This gives C = —8 so that 
the velocity of P at time f is 


v = 403 — 317 + 6r -—8ms_!. 
By writing v = dx /dt and integrating again, we obtain 
(ef =F BoP =5) 2p, 


where D is a second constant of integration. D can now be determined by using the 
given initial condition on x, namely, x = 4 when t = 0. This gives D = 4 so that the 
displacement of P at time f is 


xett—P4+3-84+4m.8 
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FIGURE 2.2 The particle P moves in three-dimensional 
space and, relative to the reference frame F and origin O, 
has position vector r at time f. 


2.2 


GENERAL MOTION OF A PARTICLE 


When a particle P moves in two or three-dimensional space, its position can be 


described by its vector displacement r from an origin O that is fixed in a rigid reference 


frame F. Whether F is moving or not is irrelevant here; the position vector r is simply 


measured relative to F. Figure 2.2 shows a particle P moving in three-dimensional space 


with position vector r (relative to the reference frame F) at time f. 


Question Reference frames 


What is a reference frame and why do we need one? 


Answer 


A rigid reference frame F is essentially a rigid body whose particles can be labelled 
to create reference points. The most familiar such body is the Earth. Relative to 
a single particle, the only thing that can be specified is distance from that particle. 
However, relative to a rigid body, one can specify both distance and direction. Thus 
the value of any vector quantity can be specified relative to F. In particular, if we 
label some particle O of the body as origin, we can specify the position of any point 
of space by its position vector relative to the frame F and the origin O. 

The specification of vectors relative to a reference frame is much simplified if we 
introduce a Cartesian coordinate system. This can be done in infinitely many different 
ways. Imagine that F is extended by a set of three mutually orthogonal planes that are 
rigidly embedded in it. The coordinates x, y, z of a point P are then the distances of P 
from these three planes. Let O be the origin of this coordinate system, and {i, j, k} 
its unit vectors. We can then conveniently refer to the frame F, together with the 
embedded coordinate system Ox yz, by the notation F {O; i, j, k}. @ 


In general motion, the velocity and acceleration of a particle are vector quantities and 


are defined by: 
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Definition 2.3 3-D velocity and acceleration The velocity v and acceleration a of P 
are defined by 


v= — and a=—. (2.4) 


Connection with the rectilinear case 

The scalar velocity and acceleration defined in section 2.1 for the case of straight line motion are simply 
related to the corresponding vector quantities defined above. It would be possible to use the vector for- 
malism in all cases but, for the case of straight line motion along the x-axis, r, v, and a would have the 
form 


where v = dx/dt anda = dv/dt. It is therefore sufficient to work with the scalar quantities x, v and a; 


use of the vector formalism would be clumsy and unnecessary. 


Example 2.4 Finding 3-D velocity and acceleration 


Relative to the reference frame F{O ; i, j, k}, the position vector of a particle P at 
time f¢ is given by 


r= 0-34 GeO 74C 427% 


Find (i) the distance OP when t = 0, (ii) the velocity of P when t = 1, (iii) the 
acceleration of P when t = 2. 


Solution 


In this solution we will make use of the rules for differentiation of sums and products 
involving vector functions of the time. These rules are listed in section 1.6. 


(i) When t = 0, r = —31+4 J so thatOP = |r| =5. 


(ii) Relative to the reference frame F, the unit vectors {i, j, k} are constant and so 
their time derivatives are zero. The velocity v of P is therefore 


v=dr/dt =4ti+4j4+ Gr +40)k. 
Whent = 1l,v=4i+4j+4+7k. 
(iii) Relative to the reference frame F, the acceleration a of P is 
a=dv/dt=4i+ (6t+4)k. 
When t = 2,a=4i+ 16k. 


Interpretation of the vectors v and a 


The velocity vector v has a simple interpretation. Suppose that s is the arc-length travelled 
by P, measured from some fixed point of its path, and that s is increasing with time.* 


* The arguments that follow assume a familiarity with the unit tangent and normal vectors to a general 
curve, as described in section 1.7 
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Then, by the chain rule, 


dr dr ds 

v= a —-_ XK — 

dt ds dt 
=vt 


where ¢ is the unit tangent vector to the path and v (= ds /dt) is the speed* of P. Thus, 
at each instant, the direction of the velocity vector v is along the tangent to its path, and 
|v | is the speed of P. 


The acceleration vector a is harder to picture. This is partly because we are too accus- 
tomed to the special case of straight line motion. However, in general, 


dv d(vt) = dv dt dv dt ds 
a= = =—t+v—= fE+u|— x — 
dt dt dt dt dt ds dt 
dv v? 
—(—J)t —)n, 2.5 
(=) + Pa (2.5) 


where n is the unit normal vector to the path of P and p (= k—') is its radius of curvature. 


Hence, the acceleration vector a has a component dv/dt tangential to the path and a 
component v*/p normal to the path. 

This formula is surprising. Since each small segment of the path is ‘approximately 
straight’ one might be tempted to conclude that only the first term (dv/dt)t should be 
present. However, what we have shown is that the acceleration vector of P does not gen- 
erally point along the path but has a component perpendicular to the local path direction. 
The full meaning of formula (2.5) will become clear when we have treated particle motion 
in polar coordinates. 


Uniform circlular motion 


The simplest example of non-rectilinear motion is motion in a circle. Circular motion 
is important in practical applications such as rotating machinery. Here we consider the 
special case of uniform circular motion, that is, circular motion with constant speed. 

Consider a particle P moving with constant speed u in the anti-clockwise direction 
around a circle centre O and radius b, as shown in Figure 2.3. At time ¢ = 0, P is at the 
point B(b, 0). What are its velocity and acceleration vectors at time f? 

The first step is to find the position vector of P at time t. Since P moves with constant 
speed u, the arc length B P travelled in time t must be ut. It follows that the angle 6 shown 
in Figure 2.3 is given by 6 = ut/b. The position vector of P at time f is therefore 


r=bcos0i+bsiné j, 
= bcos(ut/b)i + bsin(ut /b) j. 


* As in the rectilinear case, speed means the rate of increase of the total distance travelled, which, in the 
present context, is ds/dt, the rate of increase of arc length along the path of P. 
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(0, 0) 


FIGURE 2.3 Particle P moves with constant speed u 
around a circle of radius b. 


It follows that the velocity and acceleration of P at time ¢ are given by 


d 
v= — = —usin(ut/b)i + ucos(ut/b) j, 
OF = cen = eal) 
= — = —— COSLK t— — sintl : 
oat b poe oe 


We note that the speed of P, calculated from v, is 
2.3 aw) M2 
jv| = (u cos’ (ut/b) + u~ sin (ut/b)) =U, 


which is what it was specified to be. 
The magnitude of the acceleration a is given by 


1/2 


= al *(ut/b ae ca 
ja|= -s cos*(ut/b) + 7 sin* (ut /b) ? 


and, since a = —(u?/b?)r, the direction of a is opposite to that of r. This proves the 
following important result: 


Uniform circular motion 
When a particle P moves with constant speed u around a fixed circle with centre 


—_— 
O and radius b, its acceleration vector is in the direction PO and has constant 
magnitude u?/b. 


This result is consistent with the general formula (2.5). In this special case, we have 
v =uand p = b 50 that dv/dt = 0 and a = (u/b)n. 
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6 


FIGURE 2.4 The plane polar co-ordinates r, 0 
of the point P and the polar unit vectors F 
and 0 at P. 


Example 2.5 Uniform circular motion 


A body is being whirled round at 10 ms~! on the end of a rope. If the body moves 
on a circular path of 2 m radius, find the magnitude and direction of its acceleration. 


Solution 


The acceleration is directed towards the centre of the circle and its magnitude is 
10*/2 = 50 ms~|, five times the acceleration due to Earth’s gravity! 


2.3. PARTICLE MOTION IN POLAR CO-ORDINATES 


When a particle is moving in a plane, it is sometimes very convenient to use polar 
co-ordinates 7, 6 in the analysis of its motion; the case of circular motion is an obvious 
example. Less obviously, polar co-ordinates are used in the analysis of the orbits of the 
planets. This famous problem stimulated Newton to devise his laws of mechanics. 

Figure 2.4 shows the polar co-ordinates r, 6 of a point P and the polar unit vectors 
T, 1) at P. The directions of the vectors 7 and 6 are called the radial and transverse 
directions respectively at the point P. As P moves around, the polar unit vectors do not 
remain constant. They have constant magnitude (unity) but their directions depend on the 
6 co-ordinate of P; they are however independent of the 7 co-ordinate.* In other words, 
F, 6 are vector functions of the scalar variable 0. 

We will now evaluate the two derivatives dr/dé, d0/ dé. These will be needed when 
we derive the formulae for the velocity and acceleration of P in polar co-ordinates. First 
we expand’ F, @ in terms of the Cartesian basis vectors {i, j}. This gives 


TF =cosdi+sinéd j, (2.6) 
6 = —sinOi +cosé j. (2.7) 


Since 7, @ are now expressed in terms of the constant vectors i, j, the differentiations 
with respect to 6 are simple and give 


* Tf this is not clear, sketch the directions of the polar unit vectors for P in a few different positions. 
* Recall that any vector V lying in the plane of i, j can be expanded in the form V = ai + f j, where the 
coefficients a, 6 are the components of V in the i- and j-directions respectively. 
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dO 
do 


SI 5 
t 
2) 
| 


(2.8) 


Suppose now that P is a moving particle with polar co-ordinates 7, 6 that are functions 
of the time ¢. The position vector of P relative to O has magnitude O P = r and direction 
F and can therefore be written 


r=rrf. (2.9) 


In what follows, one must distinguish carefully between the position vector r, which is 
— 


the vector OP, the co-ordinate r, which is the distance O P, and the polar unit vector T. 
To obtain the polar formula for the velocity of P, we differentiate formula (2.9) with 


dr Perey 
p= = Soe (+ err (Z) (2.10) 


Par (=) (2.11) 
dt 


respect to ft. This gives 


We will use the dot notation for time derivatives throughout this section; 7 means dr /dt, 
6 means d@/dt, # means d?r/dt? and 6 means d6/dt?. 
Now fF is a function of @ which is, in its turn, a function of t. Hence, by the chain rule 
and formula (2.8), 
BU oe peg. 
dt do dt 


If we now substitute this formula into equation (2.11) we obtain 
v =F + (6) 6, (2.12) 


which is the polar formula for the velocity of P. 
To obtain the polar formula for acceleration, we differentiate the velocity formula 
(2.12) with respect to t. This gives* 


dv 
eG += ((r6)6) 


~~ 


sis dr A Ne _s\ 40 
Sree ree 2 e)e sr) 


a dr do dd do 
=fFr+r (F 4 Garo) (Fs 4 


=rrt+ (6) 04 (70 + r6)0— (6°)? 


= (i - rP)F+ (r6 + 276) 0, 


* Bea hero. Obtain this formula yourself without looking at the text. 
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which is the polar formula for the acceleration of P. These results are summarised 
below: 


Polar formulae for velocity and acceleration 


If a particle is moving in a plane and has polar coordinates 7, 0 at time ¢, then its 
velocity and acceleration vectors are given by 


v =F + (6) 6, (2.13) 
a= (i —r6?) 7+ (r6 + 276) 6. (2.14) 


The formula (2.13) shows that the velocity of P is the vector sum of an outward radial 
velocity 7 and a transverse velocity 6; in other words v is just the sum of the velocities 
that P would have ifr and 6 varied separately. This is not true for the acceleration as it will 
be observed that adding together the separate accelerations would not yield the term 2760. 
This ‘Coriolis term’ is certainly present however, but is difficult to interpret intuitively. 


Example 2.6 Velocity and acceleration in polar coordinates 


A particle sliding along a radial groove in a rotating turntable has polar coordinates at 
time ¢ given by 


r=ct 6= Qt, 


where c and Q are positive constants. Find the velocity and acceleration vectors of 
the particle at time ¢ and find the speed of the particle at time f. 

Deduce that, for ¢ > 0, the angle between the velocity and acceleration vectors is 
always acute. 


Solution 


From the polar formulae (2.13), (2.14) for velocity and acceleration, we obtain 
v=cF+ (ct)20 =c (F+ Qr6) 
and 
a= (0- (€1)2?)F + (0 + 2cQ)O = cQ (AF + 28). 


The speed of the particle at time f is thus given by |v| = c (1 + 2777) uae 


To find the angle between v and a, consider 


va =CQ(—Qt + 2A) = C7 Nt 
>0 


for t > 0. Hence, for t > 0, the angle between v and a is acute. Hl 
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General circular motion 


An important application of polar coordinates is to circular motion. We have already 
considered the special case of uniform circular motion, but now we suppose that P moves 
in any manner (not necessarily with constant speed) around a circle with centre O and 
radius b. If we take O to be the origin of polar coordinates, the condition r = b implies 
that 7 = 7 = 0 and the formula (2.13) for the velocity of P reduces to 


v = (b6) 0. (2.15) 


This result is depicted in Figure 2.5. The transverse velocity component b6 (which is not 
necessarily the speed of P since 6 may be negative) is called the circumferential velocity 
of P. Circumferential velocity will be important when we study the motion of a rigid 
body rotating about a fixed axis; in this case, each particle of the rigid body moves on a 
circular path. 

The corresponding formula for the acceleration of P is 


(0- 16°) F+ + (6 +0)6 


) 
I 


= — (067) F + + (06) 0 


2 
Vv A 
—|— 00 
(Fre 


where v is the circumferential velocity b6. These results are summarised below: 


General circular motion 


Suppose a particle P moves in any manner around the circle r = b, where r, 0 are 
plane polar coordinates. Then the velocity and acceleration vectors of P are given 
by 


v=v0, (2.16) 
vy A 
= aN AN ee (2.17) 


where v (= b6) is the circumferential velocity of P. 


The formula (2.17) shows that, in general circular motion, the acceleration of P is the 
(vector) sum of an inward radial acceleration v? /b and a transverse acceleration v. This 
is consistent with the general formula (2.5). Indeed, what the formula (2.5) says is that, 
when P moves along a completely general path, its acceleration vector is the same as if it 
were moving on the circle of curvature at each point of its path. 
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FIGURE 2.5 The particle P moves on the circle with centre O 
and radius b. At time ¢ its angular displacement is 6 and its 
circumferential velocity is bd. 


Example 2.7 Pendulum motion 


The bob of a certain pendulum moves on a vertical circle of radius b and, when the 
string makes an angle 6 with the downward vertical, the circumferential velocity v of 
the bob is given by 


y= 2¢bcos6, 


where g is a positive constant. Find the acceleration of the bob when the string makes 
angle 6 with the downward vertical. 


Solution 
From the acceleration formula (2.17), we have 
v2 Aw A 

a=— (F)r+ v0 = — (2¢cos0)F4+ vO. 
It remains to express 0 in terms of 9. On differentiating the formula v? = 2gb cos 0 
with respect to ¢, we obtain 

2vb = — (2gb sind) 6, 
and, since bd = v, we find that 
v = —g sing. 


Hence the acceleration of the bob when the string makes angle 0 with the downward 
vertical is 


a = — (2g cos 0)? — (g sin6) 0. ™ 


2.4 RIGID BODY ROTATING ABOUT A FIXED AXIS 


Some objects that we find in everyday life, such as a brick or a thick steel rod, 
are so difficult to deform that their shape is virtually unchangeable. We model such an 
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FIGURE 2.6 The rigid body 6 rotates about the fixed axis Oz and has angular 
displacement @ at time t. Each particle P of S moves on a circular path; the point 
Po is the reference position of P. 


object by a rigid body, a collection of particles forming a perfectly rigid framework. Any 
motion of the rigid body must maintain this framework. 

An important type of rigid body motion is rotation about a fixed axis; a spinning fan, 
a door opening on its hinges and a playground roundabout are among the many examples 
of this type of motion. Suppose 6 is a rigid body which is constrained to rotate about 
the fixed axis Oz as shown in Figure 2.6. (This means that the particles of 6 that lie on 
Oz are held fixed. Rotation about Oz is then the only motion of 6 consistent with rigid- 
ity.) At time t, B has angular displacement 6 measured from some reference position. 
The angular displacement 6 is the rotational counterpart of the Cartesian displacement x 
of a particle in straight line motion. By analogy with the rectilinear case, we make the 
following definitions: 


Definition 2.4 Angular velocity The angular velocity w of B is defined to be w = 
d@/dt and the absolute value of w is called the angular speed of B. 


Units. Angular velocity (and angular speed) are measured in radians per second (rad s~!). 


Example 2.8 Spinning crankshaft 1 


The crankshaft of a motorcycle engine is spinning at 6000 revolutions per minute. 
What is its angular speed in S.I. units? 


Solution 
6000 revolutions per minute is 100 revolutions per second which is 200z radians per 
second. This is the angular speed in S.I. units. 


Particle velocities in a rotating rigid body 

In rotational motion about a fixed axis, each particle P of 6 moves on a circle of some 
radius p, where p is the (fixed) perpendicular distance of P from the rotation axis. It then 
follows from (2.16) that the circumferential velocity v of P is given by 9, that is 


eae (2.18) 
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Example 2.9 Spinning crankshaft 2 


In the crankshaft example above, find the speed of a particle of the crankshaft that 
has perpendicular distance 5 cm from the rotation axis. Find also the magnitude of its 
acceleration. 


Solution 


In this case, |w| = 2002 and p = 1/20 so that the particle speed (the magnitude of 


the circumferential velocity v) is 107 ~ 31.4ms—!. 


Since the circumferential velocity is constant, |a| = v*/p = (10s)?/0.05 ~ 
2000 ms~*, which is two hundred times the value of the Earth’s gravitational accel- 
eration! 


2.5 RIGID BODY IN PLANAR MOTION 


We now consider a more general form of rigid body motion called planar motion. 


Definition 2.5 Planar motion A rigid body B is said to be in planar motion if each 
particle of B moves in a fixed plane and all these planes are parallel to each other. 


Planar motion is quite common. For instance, any flat-bottomed rigid body sliding on a 
flat table is in planar motion. Another example is a circular cylinder rolling on a rough flat 
table. 

The particle velocities in planar motion can be calculated by the following method; 
the proof is given in Chapter 16. First select some particle C of the body as the reference 
particle. The velocity of a general particle P of the body is then the vector sum of 


(i) a translational contribution equal to the velocity of C (as if the body did not 
rotate) and 

(ii) a rotational contribution (as if C were fixed and the body were rotating with 
angular velocity w about a fixed axis through C). 


This result is illustrated in Figure 2.7, where the body is a rectangular plate and the refer- 
ence particle C is at a corner of the plate. The velocity v of P is given by v = v© 4+ v*, 
where the translational contribution v© is the velocity of C and the rotational contribution 
v® is caused by the angular velocity w about C. Although the reference particle can be 
any particle of the body, it is usually taken to be the centre of mass or centre of symmetry 
of the body. 


Example 2.10 The rolling wheel 


A circular wheel of radius 6 rolls in a straight line with speed u on a fixed horizontal 
table. Find the velocities of its particles. 


Solution 


This is an instance of planar motion and so the particle velocities can be found by 
the method above. Let the position of the wheel at some instant be that shown in 
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FIGURE 2.7 The velocity of the particle P belonging to 
the rigid body 6 is the sum of the translational 
contribution v© and the rotational contribution v*. The 
reference particle C can be any particle of the body. 


FIGURE 2.8 The circular wheel rolls from left to right on a fixed horizontal 
table. The reference particle C is taken to be the centre of the wheel and 
the velocity of a typical particle P is the sum of the two velocities shown. 


Figure 2.8. The reference particle C is taken to be the centre of the wheel, and the 
wheel is supposed to have some angular velocity w about C. The velocity v’ of a 
typical particle P is then the sum of the two velocities shown. In terms of the vectors 
{i, J} 
v? =uit wp (cos @i — sin j) 
= (u+a@pcos@)i — (wp sin®) j. (2.19) 


In particular, on taking p = b and 6 = 7, the velocity v2 of the contact particle Q is 
given by 


v2 =(u—wb)i. (2.20) 
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If the wheel is allowed to slip as it moves across the table, there is no restriction on 
v2 so that w and w are unrelated. But rolling, by definition, requires that 


v2 =0. (2.21) 


On applying this rolling condition to our formula (2.20) for v2, we find that w must 
be related to u by 


(0.29) 


and on using this value of w in (2.19) we find that the velocity of the typical particle 
P is given by 


wv =u (1 ey * cos) i—u (5 sin) i. (2.23) 
When P lies on the circumference of the wheel, this formula simplifies to 
v? =u(14+cos@) i—usind j, (2.24) 
in which case the speed of P is given by 
|v” | = 2ucos(6/2), (—1 <0 <n). 


Thus the highest particle of the wheel has the largest speed, 2u, while the contact 
particle has speed zero, as we already know. @ 


2.6 REFERENCE FRAMES IN RELATIVE MOTION 


A reference frame is simply a rigid coordinate system that can be used to specify 
the positions of points in space. In practice it is convenient to regard a reference frame 
as being embedded in, or attached to, some rigid body. The most familiar case is that in 
which the rigid body is the Earth but it could instead be a moving car, or an orbiting space 
station. In principle, any event, the motion of an aircraft for example, can be observed 
from any of these reference frames and the motion will appear different to each observer. 
It is this difference that we now investigate. 

Let the motion of a particle P be observed from the reference frames F {O; i, j, k} 
and F’{O’;i, j, k} as shown in Figure 2.9. Here we are supposing that the frame F’ 
does not rotate relative to F. This is why, without losing generality, we can suppose that 
F and F’ have the same set of unit vectors {i, j, k}. For example, P could be an aircraft, 
F could be attached to the Earth, and F’ could be attached to a car driving along a straight 
road. 

Then, r, r’, the position vectors of P relative to F, F’ are connected by 


r=r+D, (2.25) 


where D is the position vector of O’ relative to F. 
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FIGURE 2.9 The particle P is observed from the two reference frames F and F’. 


We now differentiate this equation with respect to f, a step that requires some care. 
Let us consider the rates of change of the vectors in equation (2.25), as observed from the 
frame F. Then 


v= a). +V, (2.26) 


where v is the velocity of P observed in F and V is the velocity of F’ relative to F. 
Now when two different reference frames are used to observe the same vector, the 
observed rates of change of that vector will generally be different. In particular, it is not 


(=) -(<) 
dt) r dt } rr 


However, as we will show in Chapter 17, these two rates of change are equal if the frame 


generally true that 


F does not rotate relative to F. Hence, in our case, we do have 


(¢ a ) (¢ r’ ) : 
— — — ie ‘ 
dt )¢ dt } 
where v’ is the velocity of P observed in F’. 

Equation (2.26) can then be written 


v=uv+V (2.27) 


Thus the velocity of P observed in F is the sum of the velocity of P observed in F' and 
the velocity of the frame F' relative to F. This result applies only when F’ does not 
rotate relative to F. 
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This is the well known rule for handling ‘relative velocities’. In the aircraft example, 
it means that the true velocity of the aircraft (relative to the ground) is the vector sum of 
(i) the velocity of the aircraft relative to the car, and (11) the velocity of the car relative to 
the road. 


Example 2.11 Relative velocity 


The Mississippi river is a mile wide and has a uniform flow. A steamboat sailing at 
full speed takes 12 minutes to cover a mile when sailing upstream, but only 3 minutes 
when sailing downstream. What is the shortest time in which the steamboat can cross 
the Mississippi to the nearest point on the opposite bank? 


Solution 


The way to handle this problem is to view the motion of the boat from a reference 
frame F’ moving with the river. In this reference frame the water is at rest and the 
boat sails with the same speed in all directions. The relative velocity formula (2.27) 
then gives us the true picture of the motion of the boat relative to the river bank, which 
is the reference frame F. 

Let u® be the speed of the boat in still water and u* be the speed of the river, both 
measured in miles per hour. The upstream and downstream times are just a sneaky 
way of telling us the values of u? and u*. When the boat sails downstream, (2.27) 
implies that its speed relative to the bank is vu? + u*. But this speed is stated to be 
1/3 mile per minute (or 20 miles per hour). Hence 


uP +u% = 20. 


B 


Similarly the upstream speed is u? — u* and is stated to be 1/12 mile per minute (or 


5 miles per hour). Hence 
u® —u*® =5. 
Solving these equations yields 
u® = 12.5 mph, u® = 7.5 mph. 


Now the boat must cross the river. In order to cross by a straight line path to the 
nearest point on the opposite bank, the boat’s velocity (relative to the water) must be 
directed at some angle @ to the required path (as shown in Figure 2.10) so that its 
resultant velocity is perpendicular to the stream. For this to be true, w must satisfy 


u? sina = a”. 
which gives sina = 3/5. The resultant speed of the boat when crossing the river is 
therefore uw? cosa = 12.5 x (4/5) = 10 mph. Since the river is one mile wide, the 
time taken for the crossing is 1/10 hour = 6 minutes. 
The relative velocity formula (2.27) can be differentiated again with repect to ¢ to give 
a similar connection between accelerations. The result is that 


a=a'+A, (2.28) 
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FIGURE 2.10 The river flows from left to right with speed u* and the boat sails with speed 
u® relative to the river. In each case the velocity of the boat relative to the bank is the 
vector sum of the two velocities shown. 


where a and a’ are the accelerations of P relative to the frames F and F’ respectively, 
and A is the acceleration of the frame F’ relative to the frame F. Once again, this result 
applies only when F’ does not rotate relative to F. 


Mutually unaccelerated frames 


An important special case of equation (2.28) occurs when the frame F’ is moving with 
constant velocity (and no rotation) relative to F. We will then say that F and F’ are 
mutually unaccelerated frames. In this case A = 0 and (2.28) becomes 


a=a'. (2.29) 


This means that when mutually unaccelerated frames are used to observe the motion of a 
particle P,, the observed acceleration of P is the same in each frame. 
This result will be vital in our discussion of inertial frames in Chapter 3. 


Problems on Chapter 2 


Answers and comments are at the end of the book. 


Harder problems carry a star (+). 


Rectilinear particle motion 


2.1 A particle P moves along the x-axis with its displacement at time ¢ given by x = 6t? — 
t? + 1, where x is measured in metres and t in seconds. Find the velocity and acceleration of 
P at time f. Find the times at which P is at rest and find its position at these times. 


2.2 A particle P moves along the x-axis with its acceleration a at time ¢ given by 


a= 6t—4ms~”. 
Initially P is at the point x = 20m and is moving with speed 15 ms~! in the negative x- 
direction. Find the velocity and displacement of P at time ¢. Find when P comes to rest and 
its displacement at this time. 
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2.3 Constant acceleration formulae A particle P moves along the x-axis with constant 
acceleration a in the positive x-direction. Initially P is at the origin and is moving with velocity 
u in the positive x-direction. Show that the velocity v and displacement x of P at time ¢ are 
given by* 


v=u-+at, x =ut + hat’, 


and deduce that 


v=w + 2ax. 


In a standing quarter mile test, the Suzuki Bandit 1200 motorcycle covered the quarter 
mile (from rest) in 11.4 seconds and crossed the finish line doing 116 miles per hour. Are 
these figures consistent with the assumption of constant acceleration? 


General particle motion 


2.4 The trajectory of a charged particle moving in a magnetic field is given by 
r=bcosQti+bsinQt j +ctk, 


where b, 2 and c are positive constants. Show that the particle moves with constant speed and 
find the magnitude of its acceleration. 


2.5 Acceleration due to rotation and orbit of the Earth A body is at rest at a location on the 
Earth’s equator. Find its acceleration due to the Earth’s rotation. [Take the Earth’s radius at 
the equator to be 6400 km.] 

Find also the acceleration of the Earth in its orbit around the Sun. [Take the Sun to be 
fixed and regard the Earth as a particle following a circular path with centre the Sun and radius 
15 x 10!° m. 


2.6 An insect flies on a spiral trajectory such that its polar coordinates at time ¢ are given by 
r= be, g=, 


where b and Q2 are positive constants. Find the velocity and acceleration vectors of the insect 
at time f, and show that the angle between these vectors is always 7/4. 


2.7 A racing car moves on a circular track of radius b. The car starts from rest and its speed 
increases at a constant rate w. Find the angle between its velocity and acceleration vectors at 
time f. 


* These are the famous constant acceleration formulae. Although they are a mainstay of school mechan- 
ics, we will make little use of them since, in most of the problems that we treat, the acceleration is not 
constant. It is a serious offence to use these formulae in non-constant acceleration problems. 
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2.8 A particle P moves ona circle with centre O and radius b. At a certain instant the speed 
of P is v and its acceleration vector makes an angle a with PO. Find the magnitude of the 
acceleration vector at this instant. 


2.9% A bee flies on a trajectory such that its polar coordinates at time ¢ are given by 
bt t 
r=—+z(Qt—-t) 6=-— (0<t<2r), 
T T 


where b and T are positive constants. Find the velocity vector of the bee at time f. 
Show that the least speed achieved by the bee is b/t. Find the acceleration of the bee at 
this instant. 


2.10% A pursuit problem: Daniel and the Lion The luckless Daniel (D) is thrown into a 
circular arena of radius a containing a lion (L). Initially the lion is at the centre O of the 
arena while Daniel is at the perimeter. Daniel’s strategy is to run with his maximum speed u 
around the perimeter. The lion responds by running at its maximum speed U in such a way 
that it remains on the (moving) radius OD. Show that r, the distance of L from O, satisfies 


the differential equation 
2 u2 (U2a2 2 
r=—([———r*}. 
az \ uw 


Find r as a function of ¢. If U > u, show that Daniel will be caught, and find how long this 
will take. 

Show that the path taken by the lion is an arc of a circle. For the special case in which 
U =u, sketch the path taken by the lion and find the point of capture. 


2.11 General motion with constant speed A particle moves along any path in three- 
dimensional space with constant speed. Show that its velocity and acceleration vectors must 
always be perpendicular to each other. [Hint. Differentiate the formula v - v = v* with respect 
to ¢.] 


2.12 A particle P moves so that its position vector r satisfies the differential equation 
r=cxr, 


where c is a constant vector. Show that P moves with constant speed on a circular path. [Hint. 
Take the dot product of the equation first with ¢ and then with r.] 


Angular velocity 


2.13 A large truck with double rear wheels has a brick jammed between two of its tyres which 
are 4 ft in diameter. If the truck is travelling at 60 mph, find the maximum speed of the brick 
and the magnitude of its acceleration. [Express the acceleration as a multiple of g = 32 fts~?.] 


2.14 A particle is sliding along a smooth radial grove in a circular turntable which 1s rotating 
with constant angular speed (2. The distance of the particle from the rotation axis at time ¢ is 
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FIGURE 2.11 Cam and valve mechanism 


Caen 


FIGURE 2.12 Crank and piston mechanism 


observed to be 
r = bcosh Qt 


for t > 0, where b is a positive constant. Find the speed of the particle (relative to a fixed 
reference frame) at time ¢, and find the magnitude and direction of the acceleration. 


2.15 Figure 2.11 shows an eccentric circular cam of radius b rotating with constant angular 
velocity w about a fixed pivot O which is a distance e from the centre C. The cam drives a 
valve which slides in a straight guide. Find the maximum speed and maximum acceleration of 
the valve. 


2.16 Figure 2.12 shows a piston driving a crank O P pivoted at the end O. The piston slides 
in a straight cylinder and the crank is made to rotate with constant angular velocity w. Find the 
distance OQ in terms of the lengths b, c and the angle 0. Show that, when b/c is small, OQ 
is given approximately by 


2 


b 
OQ =c+bcosé — — sin’ 6, 
2c 


on neglecting (b/c)* and higher powers. Using this approximation, find the maximum accel- 
eration of the piston. 


2.17 Figure 2.13 shows an epicyclic gear arrangement in which the ‘sun’ gear G, of radius 
b, and the ‘ring’ gear G2 of inner radius b2 rotate with angular velocities w1, w2 respectively 
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© : 
FIGURE 2.13 Epicyclic gear mechanism 
Y 
C 
FIGURE 2.14 The pins P and Q at the ends Ww P 
of a rigid link move along the axes OX, OY > X 
~ a > 


respectively. 


about their fixed common centre O. Between them they grip the ‘planet’ gear G, whose centre 
C moves on a circle centre O. Find the circumferential velocity of C and the angular velocity 
of the planet gear G. If O and C were connected by an arm pivoted at O, what would be the 
angular velocity of the arm? 


2.18 Figure 2.14 shows a straight rigid link of length a whose ends contain pins P, Q that 
are constrained to move along the axes OX, OY. The displacement x of the pin P at time ¢ 
is prescribed to be x = bsin Qt, where b and Q are positive constants with b < a. Find the 
angular velocity w and the speed of the centre C of the link at time r. 


Relative velocity 


2.19 An aircraft is to fly from a point A to an airfield B 600 km due north of A. If a steady 
wind of 90 km/h is blowing from the north-west, find the direction the plane should be pointing 
and the time taken to reach B if the cruising speed of the aircraft in still air is 200 km/h. 


2.20 An aircraft takes off from a horizontal runway with constant speed U, climbing at a 
constant angle @ to the horizontal. A car is moving on the runway with constant speed u 
directly towards the front of the aircraft. The car is distance a from the aircraft at the instant 
of take-off. Find the distance of closest approach of the car and aircraft. [Don’t try this one at 
home. ] 


2.21% An aircraft has cruising speed v and a flying range (out and back) of Ro in still air. 
Show that, in a north wind of speed u (u < v) its range in a direction whose true bearing from 
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FIGURE 2.15 The dog D chases the hare H 
by running directly towards the hare’s current 
position. 


north is 6 is given by 


Ro(v? — u?) 
v(v2 — u? sin? 61/2 


What is the maximum value of this range and in what directions is it attained? 


Computer assisted problems 


2.22 Dog chasing a hare; another pursuit problem. Figure 2.15 shows a dog with position 
vector r? and velocity v? chasing a hare with position vector r” and velocity v/. The dog’s 
strategy is to run directly towards the current position of the hare. Given the motion of the hare 
and the speed of the dog, what path does the dog follow? 


Since the dog runs directly towards the hare, its velocity v? must satisfy 


D HH D 


v r = 
yo jr — P| 


In terms of the position vector of the dog relative to the hare, given by R = rP —r"1 this equation becomes 


R= SP _ oH, 
R 
Given the velocity v/ of the hare and the speed v of the dog as functions of time, this differential equation 
determines the trajectory of the dog relative to the hare; capture occurs when R = 0. The actual trajectory 
of the dog is given by r? = R+r". 
If the motion takes place in a plane with R = Xi+Y j then X and Y satisfy the coupled differential 
equations 


vPX H a vPy ae 
; ~~ (x2 4. y2y1/2 y? 


~ (O24 y212 


together with initial conditions of the form X(0) = xo and Y(0) = yo. Such equations cannot usually 
be solved analytically but are extremely easy to solve with computer assistance. Two interesting cases to 
consider are as follows. In each case the speeds of the dog and the hare are constants. 


(i) Initially the hare is at the origin with the dog at some point (xg, yg). The hare then runs along the 
positive x-axis and is chased by the dog. Show that the hare gets caught if vP > v4, but when v? = v4 
the dog always misses (unless he starts directly in the path of the hare). This remarkable result can be 


proved analytically. 
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(ii) The hare runs in a circle (like the lion problem). In this case, with yP = yl , the dog seems to 
miss no matter where he starts. 


Try some examples of your own and see if you can find interesting paths taken by the dog. 


2.23 Consider further the piston problem described in Problem 2.16. Use computer assistance 
to calculate the exact and approximate accelerations of the piston as functions of 8. Compare 
the exact and approximate formulae (non-dimensionalised by w*b) by plotting both on the 
same graph against 0. Show that, when b/c < 0.5, the two graphs are close, but when b/c 
gets close to unity, large errors occur. 


Chapter Three 


Newton’s laws of motion 
and the law of gravitation 


KEY FEATURES 
The key features of this chapter are Newton’s laws of motion, the definitions of mass and 
force, the law of gravitation, the principle of equivalence, and gravitation by spheres. 


This chapter is concerned with the foundations of dynamics and gravitation. Kine- 
matics is concerned purely with geometry of motion, but dynamics seeks to answer the 
question as to what motion will actually occur when specified forces act on a body. The 
rules that allow one to make this connection are Newton’s laws of motion. These are 
laws of physics that are founded upon experimental evidence and stand or fall accord- 
ing to the accuracy of their predictions. In fact, Newton’s formulation of mechanics has 
been astonishingly successful in its accuracy and breadth of application, and has survived, 
essentially intact, for more than three centuries. The same is true for Newton’s universal 
law of gravitation which specifies the forces that all masses exert upon each other. 

Taken together, these laws represent virtually the entire foundation of classical 
mechanics and provide an accurate explanation for a vast range of motions from large 
molecules to entire galaxies. 


3.1 NEWTON’S LAWS OF MOTION 


Isaac Newton’s* three famous laws of motion were laid down in Principia, written 
in Latin and published in 1687. These laws set out the founding principles of mechanics 
and have survived, essentially unchanged, to the present day. Even when translated into 
English, Newton’s original words are hard to understand, mainly because the terminology 


* Sir Isaac Newton (1643-1727) is arguably the greatest scientific genius of all time. His father was com- 
pletely uneducated and Isaac himself had no contact with advanced mathematics before the age of twenty. 
However, by the age of twenty seven, he had been appointed to the Lucasian chair at Cambridge and was 
one of the foremost scientists in Europe. His greatest achievements were his discovery of the calculus, his 
laws of motion, and his theory of universal gravitation. On the urging of Halley (the Astronomer Royal), 
Newton wrote up an account of his new physics and its application to astronomy. Philosophiae Naturalis 
Principia Mathematica was published in 1687 and is generally recognised as the greatest scientific book 
ever written. 
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of the seventeenth century is now archaic. Also, the laws are now formulated as applying 
to particles, a concept never used by Newton. A particle is an idealised body that occupies 
only a single point of space and has no internal structure. True particles do not exist’ in 
nature, but it is convenient to regard realistic bodies as being made up of particles. Using 
modern terminology, Newton’s laws may be stated as follows: 


Newton’s laws of motion 


First Law When all external influences on a particle are removed, the particle 
moves with constant velocity. [This velocity may be zero in which case the particle 
remains at rest.] 


Second Law When a force F acts on a particle of mass m, the particle moves with 
instantaneous acceleration a given by the formula 


F =ma, 


where the unit of force is implied by the units of mass and acceleration. 


Third Law When two particles exert forces upon each other, these forces are (1) 
equal in magnitude, (ii) opposite in direction, and (111) parallel to the straight line 
joining the two particles. 


Units 


Any consistent system of units can be used. The standard scientific units are SI units in 
which the unit of mass is the kilogram, the unit of length is the metre, and the unit of 
time is the second. The unit of force implied by the Second Law is called the newton, 
and written N. An excellent description of the SI system of units can be found on 


http://www.physics.nist.gov/PhysRefData 


the website of the US National Institute of Standards & Technology. 

In the Imperial system of units, the unit of mass is the pound, the unit of length is the 
foot, and the unit of time is the second. The unit of force implied by the Second Law is 
called the poundal. These units are still used in some industries in the US, a fact which 
causes frequent confusion. 


Interpreting Newton’s laws 


Newton’s laws are clear enough in themselves but they leave some important questions 
unanswered, namely: 


(i) In what frame of reference are the laws true? 


+ The nearest thing to a particle is the electron, which, unlike other elementary particles, does seem to be a 
point mass. The electron does however have an internal structure, having spin and angular momentum. 
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(ii) What are the definitions of mass and force? 


These questions are answered in the sections that follow. What we do is to set aside 
Newton’s laws for the time being and go back to simple experiments with particles. These 
are ‘thought experiments’ in the sense that, although they are perfectly meaningful, they 
are unlikely to be performed in practice. The supposed ‘results’ of these experiments are 
taken to be the primitive governing laws of mechanics on which we base our definitions of 
mass and force. Finally, these laws and definitions are shown to be equivalent to Newton’s 
laws as stated above. This process could be said to provide an interpretation of Newton’s 
laws. The interpretation below is quite sophisticated and is probably only suitable for 
those who have already seen a simpler account, such as that given by French [3]. 


3.2 INERTIAL FRAMES AND THE LAW OF INERTIA 


The first law states that, when a particle is unaffected by external influences, it 
moves with constant velocity, that is, it moves in a straight line with constant speed. 
Thus, contrary to Aristotle’s view, the particle needs no agency of any kind to maintain 
its motion.* Since the influence of the Earth’s gravity rules out any verification of the 
First Law by an experiment conducted on Earth, Newton showed remarkable insight in 
proposing a law he could not possibly verify. In order to verify the First Law, all external 
influences must be removed, which means that we must carry out our thought experiment 
in a place as remote as possible from any material bodies, such as the almost empty space 
between the galaxies. In our minds then we go to such a place armed with a selection of 
test particles’ which we release in various ways and observe their motion. According to 
the First Law, each of these particles should move with constant velocity. 


Inertial reference frames 


So far we have ignored the awkward question as to what reference frame we should use to 
observe the motion of our test particles. When confronted with this question for the first 
time, one’s probable response is that the reference frame should be ‘fixed’. But fixed to 
what? The Earth rotates and is in orbital motion around the Sun. Our entire solar system 
is part of a galaxy that rotates about its centre. The galaxies themselves move relative 
to each other. The fact is that everything in the universe is moving relative to everything 
else and nothing can properly be described as fixed. From this it might be concluded that 
any reference frame is as good as any other, but this is not so, for, if the First Law is 
true at all, it can only be true in certain special reference frames. Suppose for instance 
that the First Law has been found to be true in the reference frame F. Then it is also 
true in any other frame F’ that is mutually unaccelerated relative to F (see section 2.6). 
This follows because, if the test particles have constant velocities in F, then they have 


* Such a law was proposed prior to Newton by Galileo but, curiously, Galileo did not accept the conse- 
quences of his own statement. 
* Since true particles do not exist, we will have to make do with uniform rigid spheres of various kinds. 
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zero accelerations in F. But, since F and F’ are mutually unaccelerated frames, the 
test particles must have zero accelerations in F’ and thus have constant velocities in F’. 
Moreover, the First Law does not hold in any other reference frame. 


Definition 3.1 Inertial frame A reference frame in which the First Law is true is said 
to be an inertial frame. 


It follows that, if there exists one inertial frame, then there exist infinitely many, with 
each frame moving with constant velocity (and no rotation) relative to any other. 

It may appear that the First Law is without physical content since we are saying that 
it is true in those reference frames in which it is true. However, this is not so since inertial 
frames need not have existed at all, and the fact that they do is the real physical content of 
the First Law. Why there should exist this special class of reference frames in which the 
laws of physics take simple forms is a very deep and interesting question that we do not 
have to answer here! 

Our discussion is summarised by the following statement which we take to be a law 
of physics: 


The law of inertia There exists in nature a unique class of mutually unaccelerated refer- 
ence frames (the inertial frames) in which the First Law is true. 


Practical inertial frames 


The preceding discussion gives no clue as to how to set up an inertial reference frame and, 
in practice, exact inertial frames are not available. Practical reference frames have to be tied 
to real objects that are actually available. The most common practical reference frame is the 
Earth. Such a frame is sufficiently close to being inertial for the purpose of observing most 
Earth-bound phenomena. The orbital acceleration of the Earth is insignificant and the effect of 
the Earth’s rotation is normally a small correction. For example, when considering the motion 
of a football, a pendulum or a spinning top, the Earth may be assumed to be an inertial frame. 
However, the Earth is not a suitable reference frame from which to observe the motion of 
an orbiting satellite. In this case, the geocentric frame (which has its origin at the centre of 
mass of the Earth and has no rotation relative to distant stars) would be appropriate. Similarly, 
the heliocentric frame (which has its origin at the centre of mass of the solar system and has 
no rotation relative to distant stars) is appropriate when observing the motion of the planets. 


Example 3.1 /nertial frames 


Suppose that a reference frame fixed to the Earth is exactly inertial. Which of the 
following are then inertial frames? 


A frame fixed to a motor car which is 

(1) moving with constant speed around a flat race track, 

(ii) moving with constant speed along a straight undulating road, 
(iii) moving with constant speed up a constant gradient, 


(iv) freewheeling down a hill. 
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FIGURE 3.1 The particles P; and P move under their 
mutual interaction and, relative to an inertial reference 
frame F, have accelerations a,2 and a2, repectively. 
These accelerations are found to satisfy the law of mutual 
interaction. 


Solution 


Only (iii) is inertial. In the other cases, the frame is accelerating or rotating relative 
to the Earth. 


3.3. THE LAW OF MUTUAL INTERACTION; MASS AND 
FORCE 


We first dispose of the question of what frame of reference should be used to 
observe the particle motions mentioned in the Second Law. The answer is that any inertial 
reference frame can be used and we will always assume this to be so, unless stated 
otherwise. As stated earlier, the problem in understanding the Second and Third Laws is 
that the concepts of mass and force are not defined, which is obviously unsatisfactory. 

Our second thought experiment is concerned with the motion of a pair of mutually 
interacting particles. The nature of their mutual interaction can be of any kind* and all 
other influences are removed. Since each particle is influenced by the other, the First Law 
does not apply. The particles will, in general, have accelerations, these being independent 
of the inertial frame in which they are measured. Our second law of physics is concerned 
with the ‘observed’ values of these mutually induced accelerations. 


The law of mutual interaction Suppose that two particles P, and P2 interact with each 
other and that Pz induces an instantaneous acceleration a\2 in P,, while P, induces an 
instantaneous acceleration a2, in Py. Then 


(i) these accelerations are opposite in direction and parallel to the straight line joining 
P, and P», 


* The mutual interaction might be, for example, (i) mutual gravitation, (ii) electrostatic interaction, caused 
by the particles being electrically charged, or (iii) the particles being connected by a fine elastic cord. 
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(ii) the ratio of the magnitudes of these accelerations, \a2\|/\a\2| is a constant inde- 
pendent of the nature of the mutual interaction between P, and P2, and indepen- 
dent of the positions and velocities’ of P, and P3. 


Moreover, suppose that when Py interacts with a third particle P3 the induced accelera- 
tions are a23 and a32, and when P, interacts with P3 the induced accelerations are a,3 
and a3. Then the magnitudes of these accelerations satisfy the consistency relation* 


lazi| | |a32| — lai3| _ 


= (3.1) 
laj2| — |aa3| [a3 | 


Definition of inertial mass 


The law of mutual interaction leads us to our definitions of mass and force. The qualita- 
tive definition of the (inertial) mass of a particle is that it is a numerical measure of the 
reluctance of the particle to being accelerated. Thus, when particles P; and P2 interact, 
we attribute the fact that the induced accelerations aj2 and a2; have different magnitudes 
to the particles having different masses. This point of view is supported by the fact that the 
ratio |a21|/|a12| depends only upon the particles themselves, and not on the interaction, 
or where the particles are, or how they are moving. We define the mass ratio m,/mz of 
the particles P;, P2 to be the inverse ratio of the magnitudes of their mutually induced 
accelerations, as follows: 


Definition 3.2 Inertial mass The mass ratio m,/mz of the particles P,, Pz is defined 
to be 


mi _ |a2il 


= . (3.2) 
m2 |aj2\ 


There is however a possible inconsistency in this definition of mass ratio. Suppose 
that we introduce an third particle P3. Then, by performing three experiments, we could 
independently determine the three mass ratios m1 /m2, m2/m3 and m3/m, and there is no 
guarrantee that the product of these three ratios would be unity. However, the consistency 
relation (3.1) assures us that it would be found to be unity, and this means that the above 
definition defines the mass ratios of particles unambiguously. 

In order to have a numerical measure of mass, we simply choose some particle A as 
the reference mass (having mass one unit), in which case the mass of any other particle 
can be expressed as a number of *A-units’. If we were to use a different particle B as the 
reference mass, we would obtain a second measure of mass in B-units, but this second 
measure would just be proportional to the first, differing only by a multiplied constant. In 
SI units, the reference body (having mass one kilogram) is a cylinder of platinum iridium 
alloy kept under carefully controlled conditions in Paris. 


+ This is true when relativistic effects are negligible. 
* The significance of the consistency relation will be explained shortly. 
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Example 3.2 A strange definition of mass 


Suppose the mass ratio m2/mj, were defined in some other way, such as 


1/2 
my _ (<2) j 
m2 |ay2\ 


Is this just as good as the standard definition? 


Solution 


For some purposes it would be just as good. It would lead to the non-standard form 
F=m’a 


for the second law, and, for the motion of a single particle, the theory would be essen- 
tially unaffected. We will see later however that, if mass were defined in this way, then 
the mass of a multi-particle system would not be equal to the sum of the masses of its 
constituent particles! This is not contradictory, but it is a very undesirable feature and 
explains why the standard definition is used. @ 


Definition of force 


We now turn to the definition of force. Qualitatively, the presence of a force is the reason 
we give for the acceleration of a particle. Thus, when interacting particles cause each 
other to accelerate, we say it is because they exert forces upon each other. How do we 
know that these forces are present? Because the particles are accelerating! These state- 
ments are obviously circular and without real content. Force is therefore a quantity of 
our own invention, but a very useful one nonetheless and an essential part of the New- 
tonian formulation of classical mechanics. It should be noted though that the concept of 
force is not an essential part of the Lagrangian or Hamiltonian formulations of classical 
mechanics.* 

In mutual interactions, the forces that the particles exert upon each other are defined 
as follows: 


Definition 3.3 Force Suppose that the particles P, and P2 are in mutual interaction 
and have accelerations a,2 and a2, respectively. Then the force F \2 that Pz exerts on P, 
and the force F 2, that P, exerts on P2 are defined to be 


Fi2 =m 14)2, Fo; = 242), (3.3) 


where the unit of force is implied by the units of mass and acceleration. 


It follows that, in the case of two-particle interactions, the Second Law is true by 
the definition of force. Also, since aj2 and a2; are opposite in direction and are parallel 


* This fact is important when making connections between classical mechanics and other theories, such as 
general relativity or quantum mechanics. The concept of force does not appear in either of these theories. 
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FIGURE 3.2 The law of multiple interactions. In the presence of interactions 
from the particles P;, P2, P3, the acceleration ao of particle Po is given by 
ay = 401 + A402 + 03. 


to the line P; P2, so then are F ;2 and F; thus parts (ii) and (ii1) of the Third Law are 
automatically true. Furthermore 


|F 12] = mj|a12| = m2la21| = |F 1], 


on using the definition (3.2) of the mass ratio m,/mz. Thus part (i) of the Third Law is 
also true. Hence the law of mutual interaction, together with our definitions (3.2), (3.3) of 
mass and force, implies the truth of the Second and Third Laws. 


3.4 THE LAW OF MULTIPLE INTERACTIONS 


Our third and final thought experiment is concerned with what happens when a 
particle is subject to more than one interaction. 


The law of multiple interactions Suppose the particles Po, P\, ... Py are interacting with 
each other and that all other influences are removed. Then the acceleration ag induced in 
Po can be expressed as 


ao = 401 + ann +--+ + aon, (3.4) 


where ao, Ag2 ...are the accelerations that Po would have if the particles P,, P2, ...were 
individually interacting with Po. 

This result is sometimes expressed by saying that interaction forces act indepen- 
dently of each other. It follows that 


moag = mo (Aoi + Ao2 +--+ + aon) 
= Fo. + For +---+ Fon, 
on using the definition (3.3) of mutual interaction forces. Thus the Second Law remains 
true for multiple interactions provided that the ‘effective force’ F9 acting on Po is under- 


stood to mean the (vector) resultant of the individual interaction forces acting on Po, that 
is 


Fo = Foi + Foo +:::+ Fon. 
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This result is not always thought of as a law of physics, but it is.* It could have been 
otherwise! 


Experimental basis of Newton’s Laws 


1. We accept the law of inertia, the law of mutual interaction and the law of 
multiple interactions as the ‘experimental’ basis of mechanics. 


2. Together with our definitions of mass and force, these experimental laws imply 
that Newton’s laws are true in any inertial reference frame. 


3.5 CENTRE OF MASS 


We can now introduce the notion of the centre of mass of a collection of particles. 
Suppose we have a system of particles P,, Po, ..., Py with masses m1, ™m2,..., my, and 
Pp y Pp 
position vectors r1, r2,..., Nn respectively. Then: 


Definition 3.4 Centre of mass The centre of mass of this system of particles is the 
point of space whose position vector R is defined by 


N N 
as mir, +mory+-++++myrn _ ia Mili 7 a iri (3.5) 
my +m2++:++my yy mi M 


where M is the sum of the separate masses. 


The centre of mass of a system of particles is simply a ‘weighted’ mean of the position 
vectors of the particles, where the ‘weights’ are the particle masses. Centre of mass is an 
important concept in the mechanics of multi-particle systems. Unfortunately, there is a 
widespread belief that the centre of mass has a magical ability to describe the behaviour 
of the system in all circumstances. This is simply not true. For instance, we will show in 
the next section that it is not generally true that the total gravitational force that a system 
of masses exerts on a test mass is equal to the force that would be exerted by a particle of 
mass M situated at the centre of mass. 


Example 3.3 Finding centres of mass 


Find the centre of mass of (i) a pair of particles of different masses, (ii) three identical 
particles. 


Solution 
(i) For a pair of particles P}, P2, the position vector of the centre of mass is given by 


— mr, + mr 
mytm, ~ 


* It certainly does not follow from the observation that ‘forces are vector quantities’! 
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It follows that the centre of mass lies on the line P; P2 and divides this line in the ratio 
m2:mMy4. 

(ii) For three identical particles P|, P2, P3, the position vector of the centre of mass 
is given by 


mr, +mr2+mr3 ry trnter3 
m+m+m - 3 


It follows that the centre of mass lies at the centroid of the triangle P; P2 P3. @ 


The centres of mass of most of the systems we meet in mechanics are easily deter- 
mined by symmetry considerations. However, when symmetry is lacking, the position of 
the centre of mass has to be worked out from first principles by using the definition (3.5), 
or its counterpart for continuous mass distributions. The Appendix at the end of the book 
contains more details and examples. 


3.6 THE LAW OF GRAVITATION 


Physicists recognise only four distinct kinds of interaction forces that exist in 
nature. These are gravitational forces, electromagnetic forces and weak/strong nuclear 
forces. The nuclear forces are important only within the atomic nucleus and will not 
concer us at all. The electromagnetic forces include electrostatic attraction and repul- 
sion, but we will encounter them mainly as ‘forces of contact’ between material bod- 
ies. Since such forces are intermolecular, they are ultimately electromagnetic although 
we will make no use of this fact! The present section however is concerned with 
gravitation. 

It is an observed fact that any object with mass attracts any other object with mass 
with a force called gravitation. When gravitational interaction occurs between particles, 
the Third Law implies that the interaction forces must be equal in magnitude, opposite 
in direction and parallel to the straight line joining the particles. The magnitude of the 
gravitational interaction forces is given by: 


The law of gravitation 


The gravitational forces that two particles exert upon each other each have magni- 
tude 


m\m2G 


ee (3.6) 


where 1, m2 are the particle masses, R is the distance between the particles, and 
G, the constant of gravitation, is a universal constant. Since G is not dimensionless, 
its numerical value depends on the units of mass, length and force. 
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This is the famous inverse square law of gravitation originally suggested by Robert 
Hooke,* a scientific contemporary (and adversary) of Newton. In SI units, the constant of 
gravitation is given approximately by 


G =6.67x107'! Nm’*kg~?, (3.7) 


this value being determined by observation and experiment. There is presently no theory 
(general relativity included) that is able to predict the value of G. Indeed, the theory of 
general relativity does not exclude repulsion between masses! 

To give some idea of the magnitudes of the forces involved, suppose we have two 
uniform spheres of lead, each with mass 5000 kg (five metric tons). Their common radius 
is about 47 cm which means that they can be placed with their centres 1 m apart. What 
gravitational force do they exert upon each other when they are in this position? We 
will show later that the gravitational force between uniform spheres of matter is exactly 
the same as if all the mass of each sphere were concentrated at its centre. Given that 
this result is true, we can find the force that each sphere exerts on the other simply by 
substituting m1 = m2 = 5000 and R = 1 into equation (3.6). This gives F = 0.00167 N 
approximately, the weight of a few grains of salt! Such forces seem insignificant, but 
gravitation is the force that keeps the Moon in orbit around the Earth, and the Earth in 
orbit around the Sun. The reason for this disparity is that the masses involved are so much 
larger than those of the lead spheres in our example. For instance, the mass of the Sun is 
about 2 x 10°° kg. 


3.7. GRAVITATION BY A DISTRIBUTION OF MASS 


It is important to be able to calculate the gravitational force exerted on a particle 
by a distribution of mass, such as a disc or sphere. The Earth, for example, is an approxi- 
mately spherical mass distribution. We first treat an introductory problem of gravitational 
attraction by a pair of particles and then progress to continuous distributions of matter. 
In all cases, the law of multiple interactions means that the effective force exerted on a 
particle is the resultant of the individual forces of interaction exerted on that particle. 


Example 3.4 Attraction by a pair of particles 


A particle C, of mass m, and two particles A and B, each of mass M, are placed as 
shown in Figure 3.3. Find the gravitational force exerted on the particle C. 


Solution 
By the law of gravitation, each of the particles A and B attracts C with a force of 
magnitude F’ where 
mMG 
R2 


F’= 


) 


* Tt was Newton however who proved that Kepler’s laws of planetary motion follow from the inverse square 
law. 
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FIGURE 3.3 The particle C, of mass m, is 
attracted by the particles A and B, each of 
mass M. The resultant force on C points 
towards O. 


where R = (a* + x7)!/? is the distance AC (= BC). By symmetry, the resultant 
force F points in the direction CO and so its magnitude F can be found by summing 
the components of the contributing forces in this direction. Hence 


pa WMGoose _, MG Rcosa —omMG x 
= R2 ee a ed (age 


for x > 0. [The angle a is shown in Figure 3.3.] Thus the resultant force exerted on 
C looks nothing like the force exerted by a single gravitating particle. In particular, it 
is not equal to the force that would be exerted by a mass 2M placed at O. However, 
on writing F in the form 


2mMG a2\ 3/2 
F — 1 + —_ > 


we see that 


fox — 
x 


when x/a is large. Thus, when C is very distant from A and B, the gravitational 
force exerted on C is approximately the same as that of a single particle of mass 2M 
situated at O. 

The graph of the exact value of F as a function of x is shown in Figure 3.4. 
Dimensionless variables are used. F = 0 when x = O, and rises to a maximum when 
x = a//2 where F = 4mMG/3/3a*. Thereafter, F decreases, becoming ever 
closer to its asymptotic form m(2M)G /x?. ll 


General asymptotic form of F as 1 — co 


The asymptotic result in the last example is true for attraction by any bounded” distribution 
of mass. The general result can be stated as follows: 


* This excludes mass distributions that extend to infinity, such as an infinite straight wire. 
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FIGURE 3.4 The dimensionless resultant force 


(mMG/a?)~'F plotted against x/a. 


Let S be any bounded system of masses with total mass M. Then the force F exerted 
by S ona particle P, of mass m and position vector* r, has the asymptotic form 


mMG 
r2 Is 


asr — co, wherer = |r| andf =r/r. 

In other words, the force exerted by S on a distant particle is approximately the same 
as that exerted by a particle of mass equal to the total mass of S, situated at the centre of 
mass of S. 


Example 3.5 Gravitation by a uniform rod 


A particle P, of mass m, and a uniform rod, of length 2a and mass M, are placed as 
shown in Figure 3.5. Find the gravitational force that the rod exerts on the particle. 


Solution 


Consider the element [x, x + dx] of the rod which has mass M dx /2a and exerts 
an attractive force of magnitude 


m(M dx/2a)G 
R2 


on P, where R is the distance shown in Figure 3.5. By symmetry, the resultant force 
acts towards the centre O of the rod and can be found by summing the components of 
the contributing forces in the direction PO. Since the rod is a continuous distribution 


* The result, as stated, is true for any choice of the origin O of position vectors. However, the asymptotic 
error is least if O is located at the centre of mass of S. In this case the relative error is of order (a/ ry, 
where a is the maximum ‘radius’ of the mass distribution about O. 
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FIGURE 3.5 A particle P, of mass m, is 
attracted by a uniform rod of length 2a and 
mass M. The resultant force F on P points 
towards the centre O of the rod. 


of mass, this sum becomes an integral. The resultant force exerted by the rod thus has 
magnitude F' given by 


pa mMe a C088 9 MUG _ 
2a q RR? 2a J_q R3 
mMG f[% bdx 


2a Jeg (x2 + b2)3/2’ 


where b is the distance of P from the centre of the rod. This integral can be evaluated 
by making the substitution x = b tan 6, the limits on 0 being 6 = +8, where tan B = 
a/b. This gives 


mMG /(2sinB mMG 2a 
F => = 
2a b 2a b(b2 + a?)!/2 


_ mMG 
~ b(b2 + a2) 1/2" a 


Example 3.6 Gravitation by a uniform disk 


A particle P, of mass m, is situated on the axis of a uniform disk, of mass M and 
radius a, as shown in Figure 3.6. Find the gravitational force that the disk exerts on 
the particle. 


Solution 


Consider the element of area dA of the disk which has mass M dA/ma? and 
attracts P with a force of magnitude 


m(M dA/ma?)G 
R2 


’ 


where R is the distance shown in Figure 3.6. By symmetry, the resultant force acts 
towards the centre O of the disk and can be found by summing the components of the 
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FIGURE 3.6 Left: A particle P, of mass m, is attracted by a uniform disk of mass M and 
radius a. The resultant force F on P points towards the centre O of the disk. Right: The 
element of area dA in polar cordinares r, 0. 


contributing forces in the direction P O. The resultant force exerted by the disk thus 
has magnitude F given by 


where the integral is to be taken over the region A occupied by the disk. This integral 
is most easily evaluated using polar coordinates. In this case dA = (dr)(r dé) = 
r dr d@, and the integrand becomes 


cosa  Rcosa _ b 
R2 RP | (r2 + b2)3/2’ 


where b is the distance of P from the centre of the disk. The ranges of integration for 
r,0areQ0<r<aand0O <6 < 27. We thus obtain 


mMG r= 6=2n 
F= a2 Z [. (= aay) rar do. 


Since the integrand is independent of 0, the 0-integration is trivial leaving 


r mMG i 2abrdr  _ 2mMG b(r2 + by ay 


2 


wa r<o0 (2 +b2)3/2 a? =0 
_ 2mMG b 
_ a (a2 + p?)1/2 


Gravitation by spheres 


Because of its applications to astronomy and space travel, and because we live on a nearly 
spherical body, gravitation by a spherical mass distribution is easily the most important 
case. We suppose that the mass distribution occupies a spherical volume and is also spher- 
ically symmetric so that the mass density depends only on distance from the centre of the 
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FIGURE 3.7 Left: A particle P, of mass m, is attracted by a symmetric sphere of radius a and total 
mass M. Right: The element of volume dV in spherical polar coordinates r, 0, @. 


sphere. We call such a body a symmetric sphere. The fact that we do not require the 
density to be uniform is very important in practical applications. The Earth, for instance, 
has a density of about 3,000 kgm~? near its surface, but its density at the centre is about 
16, 500kgm~*. Similar remarks apply to the Sun. Thus, if our results were restricted to 
spheres of uniform density, they would not apply to the Earth or the Sun, the two most 
important cases. 

The fundamental result concerning gravitation by a symmetric sphere was proved by 
Newton himself and confirmed his universal theory of gravitation. It is presented here as 
a theorem. 


Theorem 3.1 The gravitational force exerted by a symmetric sphere of mass M on a 
particle external to itself is exactly the same as if the sphere were replaced by a particle 
of mass M located at the centre. 


Proof. Figure 3.7 shows a symmetric sphere with centre O and radius a, and a particle P, 
of mass m, exterior to the sphere. We wish to calculate the force exerted by the sphere on the 
particle. The calculation is similar to that in the ‘disk’ example, but this time the integration 
must be carried out over the spherical volume occupied by the mass distribution. 

Consider the element of volume dv of the sphere which has mass p dv and attracts P with 
a force of magnitude 


m(pdv)G 
R2 , 


where R is the distance shown in Figure 3.7. By symmetry, the resultant force acts towards 
the centre O of the sphere and can be found by summing the components of the contributing 
forces in the direction PO. The resultant force exerted by the sphere thus has magnitude F 
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given by 


p cosa 
F=mG dv, 
y 


where the integral is to be taken over the region VY occupied by the sphere. This inte- 
gral is most easily evaluated using spherical polar coordinates r, 6, @. In this case dv = 
(dr)(r d6)(r sin@ dd) = r? sin@ dr dé dd, and the integrand becomes 


PCosa _ pRcosa p(r) (b —r cos@) 


R2 R3 (rr? +b? — 2rbcos6)3/2’ 


on using the cosine rule R* = r* + b* — 2rbcos@, where b is the distance of P from the 
centre of the sphere. The ranges of integration forr, 0, @ areO <r < a,0 < 6 < m and 
0 < @ < 2z. We thus obtain 


r=a p0=n ph=20 — 
p(r) (b —rcosé) 2. 
F =mG ‘ 6dr dédd¢. 
m I, ie ie (eae) eee 


This time the ¢-integration is trivial leaving 


r=a 6=n i = 
F= mG | / ENO TEOSE) r sin@ dr dO 
-0 Jo=0 ¢ 
=a 


r2 + b2 — 2rbcos@)3/2 


r= =n : 
b—rcosé 6 dé 
= 2xmé | r7p(r) / ( ae dr, 
a e-0 (r2 +b? — 2rbcos6)3/2 


on taking the 6-integration first and the r-integration second. 
The 6-integration is tricky if done directly, but it comes out nicely on making the change 
of variable from @ to R given by 


R? =r? +. b* — 2rbcos8, (R > 0). 


(In this change of variable, 7 has the status of a constant.) The range of integration for R is 
b—r<R<b4+r. Then 


2RdR = 2rbsiné dé, 


2b? — 2rbcos@ _ R* + (b? —r?) 


b—rcosé = , 
2b 2b 


and the 0-integral becomes 


b+r R2 ate (b2 _ r2) RdR 1 b+r b2 _ r2 2 
= 1+ dR=-—, 
- 2bR3 rb. Orb? Jy R2 be 


on performing the now elementary integration. 


Hence 
G r=a 
F= ~ (4x i r-p(r) ar) 


and this is as far as we can go without knowing the density function p(7). The answer that we 
are looking for is that F = mMG/b*, where M is the total mass of the sphere. Now M can 
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also be calculated as a volume integral. Since the mass of the volume element dv is p dv, the 
total mass M is given by 


r=a 6=1 g=20 
M= / pav= | / / r7p(r) sind dr d6 d¢ 
v r=0 Jo=0 J¢=0 
=da 


" 
= ax [ rp(r) dr, 
T 


-—() 


on performing the 0- and ¢-integrations. 
Hence, we finally obtain 


which is the required result. 

Since there is no reason why the density p(r) should not be zero over part of its range, this 
result also applies to the case of a particle external to a hollow sphere. The case of a particle 
inside a hollow sphere is different (see Problem 3.5). Mf 


Spheres attracted by other spheres 


Since any element of mass is attracted by a symmetric sphere as if the sphere were a parti- 
cle, it follows that the force that a symmetric sphere exerts on any other mass distribution 
can be calculated by replacing the sphere by a particle of equal mass located at its centre. 
In particular then, the force that two symmetric spheres exert upon each other is the same 
as if each sphere were replaced by its equivalent particle. Thus, as far as the forces of 
gravitational attraction are concerned, symmetric spheres behave exactly as if they were 
particles. 


3.8 THE PRINCIPLE OF EQUIVALENCE AND g 


Although we have so far not mentioned it, the law of gravitation hides a deep and 
very surprising fact, namely, that the force between gravitating particles is proportional 
to each of their inertial masses. Now inertial mass, as defined by equation (3.2), has no 
necessary connection with gravitation. It is a measure of the reluctance of that particle 
to being accelerated and can be determined by non-gravitational means, for instance, by 
using electrostatic interactions between the particles. It is a matter of extreme surprise 
then that a quantity that seems to have no necessary connection with gravitation actually 
determines the force of gravitation between particles. What we would have expected was 
that each particle would have a second property m*, called gravitational mass (not the 
same as m), which appears in the law of gravitation (3.6) and determines the gravitational 
force. For example, suppose that we have three uniform spheres of gold, silver and bronze 
and that the silver and bronze spheres have equal inertial mass. Then the law of gravitation 
states that, when separated by equal distances, the gold sphere will attract the silver and 
bronze spheres with equal forces, whereas we would have expected these forces to be 
different. 
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FIGURE 3.8 A particle of mass M is attracted m; cd 
by the gravitation of the system S which M e, ene e 
consists of N particles with masses ee aad Yr. e e 
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The question arises then as to whether m and m* are actually equal or just nearly 
equal so that the difference is difficult to detect. Newton himself did experiments with 
pendulums made of differing materials, but could not detect any difference in the period. 
Newton’s experiment could have detected a difference of about one part in 10°. However, 
the classic experiment of E6tvés (1890) and its later refinements have now shown that any 
difference between m and m” is less than one part in 10!!. This leads us to believe that m 
and m* really are equal and that the law of gravitation means exactly what it says. 

The proposition that inertial and gravitational mass are exactly equal is called the 
principle of equivalence. Although we accept the principle of equivalence as being true, 
we still have no explanation why this is so! In this context, it is worth remarking that 
Einstein made the principle of equivalence into a fundamendal assumption of the theory 
of general relativity. 


The gravitational acceleration g 


Suppose a particle P of mass M is under the gravitational attraction of the system S, as 
shown in Figure 3.8. Then, by the law of gravitation, the resultant force F that S exerts 
upon P is given by 


N 
MmG Mm2G MmyG iG 
F= m eyt+ = epg MONE oy ou () uke «) 


is independent of M. Then, by the Second Law, the induced acceleration a of particle P 
is determined by the equation 


Mg=Ma, 


that is, 
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Thus the induced acceleration g is the same for any particle situated at that point. This 
rather remarkable fact is a direct consequence of the principle of equivalence. Tradition 
has it that, prior to Newton, Galileo did experiments in which he released different masses 
from the top of the Tower of Pisa and found that they reached the ground at the same 
time. Galileo’s result is thus a colourful but rather inaccurate verification of the principle 
of equivalence! 


Gravitation by the Earth (rotation neglected) 


In the present treatment, the rotation of the Earth is neglected and we regard the Earth as 
an inertial frame of reference. A more accurate treatment which takes the Earth’s rotation 
into account is given in Chapter 17. 

When the system S is the Earth (or some other celestial body) it is convenient to 
introduce the notion of the local vertical direction. The unit vector k, which has the 
opposite direction to g, is called the vertically upwards unit vector relative to the Earth. 
In terms of k, the force exerted by the Earth on a particle of mass M is given by 


F = —Mgk, 


where the gravitational acceleration g is the magnitude of the gravitational acceleration 
vector g. Both g and k are functions of position on the Earth. 


Weight 


The positive quantity Mg (which is a function of position) is called the weight of the 
particle P. It is the magnitude of the gravity force exerted on P by the Earth. Thus the 
same body will have different weights depending upon where it is situated. However, at 
a fixed point of space, the weights of bodies are proportional to their masses. This fact, 
which is a consequence of the principle of equivalence, enables masses to be compared 
simply by comparing their weights at the same location (by using a balance, for instance). 


The approximation of uniform gravity 


It is easy to see that the Earth’s gravitational acceleration g and the vertical direction k 
depend upon position. The Earth is approximately a symmetric sphere which exerts its 
gravitational force as if all its mass were at its centre. Thus, if the value of g at a point on 
the Earth’s surface is g;, then the value of g at a height of 6,400 km (the Earth’s radius) 
must be g;/4 approximately. On the other hand, the vertical vector k changes from point 
to point on the Earth’s surface. These changes will be significant for motions whose 
extent is significant compared to the Earth’s radius; this is true for a ballistic missile, for 
instance. However, most motions taking place on Earth have an extent that is insignificant 
compared to the Earth’s radius and for which the variations of g and k are negligible. 
Simple examples include the motion of a tennis ball, a javelin or a bullet. 

The approximation in which g and k are assumed to be constants is called uniform 
gravity. Uniform gravity is the most common force field in mechanics. Many of the 
problems solved in this book make this simplifying (and accurate) approximation. 
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FIGURE 3.9 An elevator contains a ball B 
and both are freely falling under uniform 
gravity. F is an inertial reference frame and 
F’ is a reference frame attached to the falling 
elevator. 
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Numerical values of ¢ 


The value of g at any location on the Earth can be measured experimentally (by using a 


pendulum for instance). The value of g is not quite constant over the Earth’s surface since 


the Earth does not quite have spherical symmetry and different locations have differing 


altitudes. At sea level on Earth, g = 9.8 ms~? approximately, and a rough value of 


10 ms~? is often assumed. The corresponding value for the Moon is 1.6 ms~”, roughly 
a sixth of the Earth’s value. 


Example 3.7 Particle inside a falling elevator 


An elevator cable has snapped and the elevator and its contents are falling under 
uniform gravity. One of the passengers takes a ball from his pocket and throws it to 
another passenger.” What is the motion of the ball relative to the elevator? 


Solution 


Suppose that the ball has mass m and that the local (vector) gravitational acceleration 
is —gk. Then the motion of the ball relative to an inertial reference frame F (fixed to 
the ground, say) is determined by the Second Law, namely, 


ma = —mgk, 


where a is the acceleration of the ball measured in F. 
Let the frame F’ be attached to the elevator, as shown in Figure 3.9. Then the 
acceleration a’ of the ball measured in F’ is given (see section 2.6) by 


a=a+A, 


where A is the acceleration of the frame F’ relative to F. But the elevator, to which 
the frame F’ is attached, is also moving under uniform gravity and its acceleration A 
is therefore, by the principle of equivalence, the same as that of the ball, namely, 


A=-—gk. 


* People do react oddly when put under pressure. 
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Hence a = a’ — gk and so 
a’ =0. 


Thus, relative to the elevator, the ball moves with constant velocity. To observers 
resident in the frame F’, gravity appears to be absent and ¥’ appears to be an inertial 
frame. This provides a practical method for simulating conditions of weightlessness. 
Fortunately for those wishing to experience weightlessness, there is no need to use an 
elevator; the same acceleration can be achieved by an aircraft in a vertical dive! 

This result is of considerable importance in the theory of general relativity. It 
shows that, locally at least, a gravitational field can be ‘transformed away’ by observ- 
ing the motion of bodies from a freely falling reference frame. HH 


Problems on Chapter 3 


Answers and comments are at the end of the book. 


Harder problems carry a star (+). 


Gravitation 


3.1 Four particles, each of mass m, are situated at the vertices of a regular tetrahedron of side 
a. Find the gravitational force exerted on any one of the particles by the other three. 

Three uniform rigid spheres of mass M and radius a are placed on a horizontal table and 
are pressed together so that their centres are at the vertices of an equilateral triangle. A fourth 
uniform rigid sphere of mass M and radius a is placed on top of the other three so that all four 
spheres are in contact with each other. Find the gravitational force exerted on the upper sphere 
by the three lower ones. 


3.2 Eight particles, each of mass m, are situated at the corners of a cube of side a. Find the 
gravitational force exerted on any one of the particles by the other seven. 

Deduce the total gravitational force exerted on the four particles lying on one face of the 
cube by the four particles lying on the opposite face. 


3.3 A uniform rod of mass M and length 2a lies along the interval [—a, a] of the x-axis and 
a particle of mass m is situated at the point x = x’. Find the gravitational force exerted by the 
rod on the particle. 

Two uniform rods, each of mass M and length 2a, lie along the intervals [—a, a] and 
[b — a,b + a] of the x-axis, so that their centres are a distance b apart (b > 2a). Find the 
gravitational forces that the rods exert upon each other. 


3.4 A uniform rigid disk has mass M and radius a, and a uniform rigid rod has mass M’ and 
length 6. The rod is placed along the axis of symmetry of the disk with one end in contact 
with the disk. Find the forces necessary to pull the disk and rod apart. [Hint. Make use of the 
solution in the ‘disk’ example.] 


3.5 Show that the gravitational force exerted on a particle inside a hollow symmetric sphere 
is zero. [Hint. The proof is the same as for a particle outside a symmetric sphere, except in 
one detail.] 
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3.6 A narrow hole is drilled through the centre of a uniform sphere of mass M and radius 
a. Find the gravitational force exerted on a particle of mass m which is inside the hole at a 
distance r from the centre. 


3.7 A symmetric sphere, of radius a and mass M, has its centre a distance b (b > a) from an 
infinite plane containing a uniform distribution of mass o per unit area. Find the gravitational 
force exerted on the sphere. 


3.8% Two uniform rigid hemispheres, each of mass M and radius a are placed in contact with 
each other so as to form a complete sphere. Find the forces necessary to pull the hemispheres 
apart. 


Computer assisted problem 


3.9 A uniform wire of mass M has the form of a circle of radius a and a particle of mass 
m lies in the plane of the wire at a distance b (b < a) from the centre O. Show that the 
gravitational force exerted by the wire on the particle (in the direction O P) is given by 


__ mMG [ (cos 6 — &)dé 
~ Ia Jo {1+ &2 — 2& cos6}3/2’ 


where the dimensionless distance € = b/a. 

Use computer assistance to plot the graph of (dimensionless) F against € for 0 < € < 0.8 
and confirm that F is positive for € > 0. Is the position of equilibrium at the centre of the 
circle stable? Could the rings of Saturn be solid? 


Chapter Four 
a | 


Problems in particle dynamics 


KEY FEATURES 

The key features in this chapter are (i) the vector equation of motion and its reduction to 
scalar equations, (ii) motion in a force field, (iii) geometrical constraints and forces of con- 
straint, and (iv) linear and quadratic resistance forces. 


Particle dynamics is concerned with the problem of calculating the motion of a particle 
that is acted upon by specified forces. Our starting point is Newton’s laws. However, since 
the First Law merely tells us that we should observe the motion from an inertial frame, 
and the Third Law will never be used (since there is only one particle), the entirety of 
particle dynamics is based on the Second Law 


ma=F\,+Fo+::-+Fy, 


where F, F2,..., Fy are the various forces that are acting on the particle. The typical 
method of solution is to write the Second Law in the form 


dv 
i fore a EN, (4.1) 


which is a first order ODE for the unknown velocity function v(t) and is called the equa- 
tion of motion of the particle. If the initial value of v is given, then equation (4.1) can 
often be solved to yield v as a function of the time f. Once v is determined (and if the 
initial position of the particle is given), the position vector r of the particle at time ¢ can 
be found by solving the first order ODE dr/dt = v. The sections that follow contain 
many examples of the implementation of this method. Indeed, it is remarkable how many 
interesting problems can be solved in this way. 


Question When can real bodies be modelled as particles? 

Newton’s laws apply to particles, but real bodies are not particles. When can real 
bodies, such as a tennis ball, a spacecraft, or the Earth, be treated as if they were 
particles? 
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After time ¢ 


—-> Cc 
> — 
+ 


FIGURE 4.1 The particle is initially at the 
origin and is projected vertically upwards 
with speed u. The particle moves in a vertical 
straight line (the axis Oz) under the uniform 
gravity force mg and possibly a resistance (or uU 

drag) force D. At time ¢ the particle has Initi ally O 
upward velocity v. { 


Answer 

This is quite a tricky question which is not fully discussed until Chapter 10. What we 
will show is that the centre of mass of any body moves as if it were a particle of mass 
equal to the total mass, and all the forces on the body acted upon it. In particular, 
a rigid body, moving without rotation, can be treated exactly as if it were a particle. 
For example, a block sliding without rotation on a table can be treated exactly as if it 
were a particle. In other cases we gain only partial information about the motion. If 
the body is a brick thrown through the air, then particle dynamics can tell us exactly 
where its centre of mass will go, but not which point of the brick will hit the ground 
first. 


4.1. RECTILINEAR MOTION IN A FORCE FIELD 


Our first group of problems is concerned with the straight line motion of a particle 
moving in a force field. A force F is said to be a field if it depends only on the position of 
the particle, and not, for instance, on its velocity or the time. For example, the gravitational 
attraction of any fixed mass distribution is a field of force, but resistance forces, which are 
usually velocity dependent, are not. 

If the rectlinear motion takes place along the z-axis, the equation of motion (4.1) 
reduces to the scalar equation 


oe Bie 4.2) 
era (z), (4. 


where v is the (one-dimensional) velocity of the particle and F (z) is the (one-dimensional) 
force field, both measured in the positive z-direction. 

First we consider the problem of vertical motion of a particle under uniform gravity 
with no air resistance. This is fine on the Moon (which has no atmosphere) but, on 
Earth, the motion of a body is resisted by its passage through the atmosphere and this will 
introduce errors. The effect of resistance forces is investigated in section 4.3. 


Example 4.1 Vertical motion under uniform gravity 
A particle is projected vertically upwards with speed u and moves in a vertical straight 


line under uniform gravity with no air resistance. Find the maximum height achieved 
by the particle and the time taken for it to return to its starting point. 
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Solution 


Let v be the upwards velocity of the particle after time ¢, as shown in Figure 4.1. Then 
the scalar equation of motion (4.2) takes the form 


dv 
m—=-—mg, 
dt 8 


since the drag force D is absent. A simple integration gives 
v=—gt+C, 


where C is the integration constant, and, on applying the initial condition v = u when 
t = 0, we obtain C = u. Hence the velocity v at time ¢ is given by 


v=u-— gt. 
To find the upward displacement z at time f, write 


dz P 
ee — oe P 
dt ? 


A second simple integration gives 
Ted 
z=ut—5gt'+D, 
where D a second integration constant, and, on applying the initial condition z = 0 
when t = 0, we obtain D = 0. Hence the upward displacement of the body at time ¢ 
is given by 


zZ=ut — 5gt. 


The maximum height z,,,x is achieved when dz/dt = 0, that is, when v = 0. 
Thus Zmax iS achieved when t = u/g and is given by 


u 1 u uz 
Zz =u z = : 
max @ 38 g 2¢ 


The particle returns to O when z = 0, that is, when 
t (u as st) =0. 
2 


Thus the particle returns after a time 2u/g. 

For example, if we throw a body vertically upwards with speed 10 ms~!, it will 
rise to a height of 5 m and return after 2s. [Here we are neglecting atmospheric 
resistance and taking g = 10 ms~?.] 
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Question Saving oneself in a falling elevator 

An elevator cable has snapped and the elevator is heading for the ground. Can the 
occupants save themselves by leaping into the air just before impact in order to avoid 
the crash? 


Answer 

Suppose that the elevator is at rest at a height H when the cable snaps. The elevator 
will fall and reach the ground with speed (2gH)!/. In order to save themselves, the 
occupants must leap upwards (relative to the elevator) with this same speed so that 
their speed relative to the ground is zero. If they were able do this, then they would 
indeed be saved. However, if they were able to project themselves upwards with this 
speed, they would also be able to stand outside the building and leap up to the same 
height H that the elevator fell from! Even athletes cannot jump much more than a 
metre off the ground, so the answer is that escape is possible in principle but not in 
practice. Hl 


Uniform gravity is the simplest force field because it is constant. In the next example 


we show how to handle a non-constant force field. 


Example 4.2 Rectilinear motion in the inverse square field 


A particle P of mass m moves under the gravitational attraction of a mass M fixed at 
the origin O. Initially P is at a distance a from O when it is projected with speed u 
directly away from O. Find the condition that P will ‘escape’ to infinity. 


Solution 
By symmetry, the motion of P takes place in a straight line through O. By the law of 
gravitation, the scalar equation of motion is 


dv mMG 
m— =— ‘ 
dt r2 
where r is the distance OP and v = 7. Equations like this can always be integrated 
once by first eliminating the time. Since 
dv dv_ dr du 
= x = ; 
dt dr dt dr 
the equation of motion can be written as 
du MG 
vw ==: 
dr r2 
a first order ODE for v as a function of r. This is to be solved with the initial condition 
v =u whenr = a. The equation separates to give 


[vav=-me [ S. 
r2 


and so 
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where C is the integration constant. On applying the initial condition v = u when 
r =a, we find that C = (u*/2) — (MG/a) so that 


v= 


2 ( > 2M “) 2MG 
Uu + é 
- 
This determines the outward velocity v as a function of r. 


Whether the particle escapes to infinity, or not, depends on the sign of the brack- 
eted constant term. 


(i) Suppose first that this term is positive so that 


2MG 
u2 => 


a? 
a 


where V is a positive constant. Then, since the term 2MG/r is positive, it follows 
that v > V at all times. It further follows thatr > a+ Vt for all ¢ and so the particle 
escapes to infinity. 


(ii) On the other hand, if u2 — (2MG /a) is negative, then v becomes zero when 
a 
r= —._—_., 
1 — (u2a/2MG) 
after which the particle falls back towards O and does not escape. 


(iii) The critical case, in which u? = 2MG /a is treated in Problem 4.10; the result is 
that the particle escapes. 


Hence the particle escapes if (and only if) 


> 2MG 
> . 


u 


a 


Question Given u, findrmagx and the time taken to get there 


For the particular case in which u2 = MG/a, find the maximum distance from O 
achieved by P and the time taken to reach this position. 


Answer 


For this value of u, the equation connecting v andr becomes 


vi =MGi-——'). 
roa 


Since 7 = /max when v = 0, it follows that the maximum distance from O achieved 
by P is 2a. 
To find the time taken, we write v = dr/dt and solve the ODE 


(7) -“o(-2) 
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with the initial condition; = a when t = 0. After taking the positive square root of 
each side (dr/dt > 0 in this motion), the equation separates to give 


2a ar 1/2 T 
/ ( ) ar =(may'? [ dt. 
_ 2a—r 0 


(Here we have introduced the initial and final conditions directly into the limits of 
integration; tT is the elapsed time.) On simplifying, we obtain 


2a ar 1/2 
r= May? | (“_) dr. 


This integral can be evaluated by making the substitution r = 2a sin? 6; the details 
are unimportant. The result is that the time taken for P to progress from r = a to 
r = 2a is 


=(i6) (4 )-= 


Question Speed of escape from the Moon 


A body is projected vertically upwards from the surface of the Moon. What projection 
speed is neccessary for the body to escape the Moon? 


Answer 

We regard the Moon as a fixed symmetric sphere of mass M and radius R. In this case, 
the gravitational force exerted by the Moon is the same as that of a particle of mass M 
situated at the centre. Thus the preceding theory applies with the distance a replaced 
by the radius R. The escape speed is therefore (2MG/R)!/?, which evaluates to 
about 2.4 kms~!. [For the Moon, M = 7.35 x 10°? kg and R = 1740 km.] m 


CONSTRAINED RECTILINEAR MOTION 


Figure 4.2 shows a uniform rigid rectangular block of mass M sliding down the 


inclined surface of a fixed rigid wedge of angle a. The initial conditions are supposed 


to be such that the block slides, without rotation, down the line of steepest slope of the 


wedge. The block is subject to uniform gravity, but it is clear that there must be other 


forces as well. If there were no other forces and the block were released from rest, then 


the block would move vertically downwards. However, solid bodies cannot pass through 


each other like ghosts, and interpenetration is prevented by (equal and opposite) forces 


that they exert upon each other. These are material contact forces which come into play 


only when bodies are in physical contact. They are examples of forces of constraint, 


which are not prescribed beforehand but are sufficient to enforce a specified geometrical 


constraint. Tradition has it that the constraint force that the wedge exerts on the block is 
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FIGURE 4.2 A rigid rectangluar block slides down the 
inclined surface of a fixed rigid wedge of angle a. Note 
that k" is the vertically upwards unit vector, while i and k 
are parallel to and perpendicular to the inclined surface of 
the wedge. 


called the total reaction force R. It is convenient to write this force in the form 
R=—Fi+Nk, 


where the unit vectors i and k are parallel to and normal to the slope of the wedge. The 
scalar N is called the normal reaction component and the scalar F is called the frictional 
component.* 

The equation of motion of the block is the vector equation (4.1) which becomes 


ais 
M oP mgkY — Fi+Nk, 


where k' is the vertically upwards unit vector. The easiest way of proceeding is to take 
components of this vector equation in the i- and k-directions (the j-component gives 


nothing). On noting that ky = — sinwi + cosak, this gives 
M— = mg sina ~ F and 0 = N —megcosa. 


The second of these equations determines the normal reaction N = mg cosa. However, 
in the first equation, both v and F are unknown and this prevents any further progress 
in the solution of this problem.* One can proceed by proposing some empirical ‘law of 
friction’, but such laws hold only very roughly. It is not surprising then that, in much of 
mechanics, frictional forces are neglected. In this case, the total reaction force exerted by 
the surface is in the normal direction and we describe such surfaces as smooth, meaning 


* The minus sign is introduced so that F will be positive when the scalar velocity v is positive. 
* This reflects the fact that we have said nothing about the roughness of the surface of the wedge! 
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a T 


FIGURE 4.3 The idealised string is depicted 
here as having a small circular cross-section. 
At each cross-section only tensile stresses 
exist and their resultant is the tension T in the 
string at that point. 


‘perfectly smooth’. Doing away with friction has the advantage of giving us a well-posed 
problem that we can solve; however the solution will then apply only approximately to 
real surfaces. 

If we now suppose that the inclined surface of the wedge is smooth, then F = 0 and 
the first equation reduces to 


dv ; 
—_—- = sin a. 
dt g 


Thus, in the absence of friction, the block slides down the plane with the constant accel- 
eration g sina. 


Inextensible strings 


Another agency that can cause a geometrical constraint is the inextensible string. If a 
particle P of a system is connected to a fixed point O by an inextensible string of length 
a then, if the string is taut, P is constrained to move so that the distance OP = a. This 
geometrical constraint is enforced by the (unknown) constraint force that the string applies 
to particle P. Our ‘string’ is an idealisation of real cords and ropes in that it is infinitely 
thin, has no bending stiffness, and is inextensible. The only force that one part of the string 
exerts on another is the tension 7 in the string, which acts parallel to the tangent vector t¢ 
to the string at each point (see Figure 4.3). 

It is evident that, in general, T varies from point to point along the string. Suppose 
for example that a uniform string of mass p per unit length is suspended vertically under 
uniform gravity. Then, since the tension at the lower end is zero, the string will not be in 
equilibrium unless the tension at a height z above the lowest point is given by T = pgz; 
the tension thus rises linearly with height. 

The situation is simpler when the mass of the string is negligible; this is the case of 
the light inextensible string.* In this case, it is obvious that the tension is constant when 
the string is straight. In fact, the tension also remains constant when the string slides over 
a smooth body. This is proved in Chapter 10. The tension in a light string is also constant 
when the string passes over a light, smoothly pivoted pulley wheel. 


* In this context, ‘light’ means ‘of zero mass’. 
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FIGURE 4.4 Attwood’s machine: two bodies 
of masses m and M are connected by a light 
inextensible string which passes over a 
smooth rail. 


Example 4.3 Attwood’s machine 


Two bodies with masses m, M are connected by a light inextensible string which 
passes over a smooth horizontal rail. The system moves in a vertical plane with the 
bodies moving in vertical straight lines. Find the upward acceleration of the mass m 
and the tension in the string. 


Solution 


The system is shown in Figure 4.4. Let v be the upward velocity of the mass m. Then, 
since the string is inextensible, v must also be the downward velocity of the mass M. 
Also, since the string is light and the rail is smooth, the string has constant tension T . 
The scalar equations of motion for the two masses are therefore 


dv r ue M T 
m— =T—me, — = —T. 
dt g dt g 


It follows that 


du M—m 2Mm 
—= g and f= g. 0 
dt M+m M+m 


Question The monkey puzzle 


Suppose that, in the last example, both bodies have the same mass M and one of them 
is a monkey which begins to climb up the rope. What happens to the other mass? 


Answer 

Suppose that the monkey climbs with velocity V relative to the rope. Then its upward 
velocity relative to the ground is V — v. The equations of upward motion for the mass 
and the monkey are therefore 


dv d(V —v) 
Mo =T — Mg, FY adc 


=T—-Meg. 
dt dt g 
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FIGURE 4.5 The drag D and lift ZL on a body moving through a 
fluid. 


On eliminating T, we find that 


v=4V, and V—v=35V. 


Thus the monkey and the mass rise (relative to the ground) with the same velocity; 
the monkey cannot avoid hauling up the mass as well as itself! mm 


4.3 MOTION THROUGH A RESISTING MEDIUM 
The physics of fluid drag 


When a body moves through a fluid such as air or water, the fluid exerts forces on the surface 
of the body. This is because the body must push the fluid out of the way, and to do this the body 
must exert forces on the fluid. By the Third Law, the fluid must then exert equal and opposite 
forces on the body. A person wading through water or riding a motorcycle is well aware of 
the existence of such forces, which fall into the general category of material contact forces. 
We are interested in the resultant force that the fluid exerts on the body and it is convenient to 
write this resultant in the form 


F=D+L, 


where the vector drag force D has the opposite direction to the velocity of the body, and the 
vector lift force L is at right angles to this velocity. The existence of lift makes air travel 
possible and is obviously very important. However, we will be concerned only with drag since 
we will restrict our attention to those cases in which the body is a rigid body of revolution 
which moves (without rotation) in the direction of its axis of symmetry. In this case, the lift is 
zero, by symmetry. We are then left with the scalar drag D, acting in the opposite direction to 
the velocity of the body. 

The theoretical determination of lift and drag forces is one of the great unsolved problems 
of hydrodynamics and most of the available data has been obtained by experiment. Even for 
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the case of a rigid sphere moving with constant velocity through an incompressible* fluid, a 
general theoretical solution for the drag is not available. In this problem, the drag depends 
on the radius a and speed V of the sphere, and the density p and viscosity jz of the fluid. 
Straightforward dimensional analysis shows that D must have the form 


V 
D = pa2V?F (“*) 
LL 


where F is a function of a single variable. 


Definition 4.1 Reynold’s number The dimensionless quantity R = pV.a/, is called the 
Reynolds number." It is more commonly written R = Va/v, where the quantity v = 1/p is 
called the kinematic viscosity of the fluid. 


The function F(R) has never been calculated theoretically, and experimental data must be 
used. It is a surprising fact that, for a wide range of values of R (about 1000 < R < 100, 000), 
the function F is found to be roughly constant. Subject to this approximation, the formula for 
the drag becomes 


D= Cpa’*V?, 


where the dimensionless constant C is called the drag coefficient* for the sphere; its value is 
about 0.8. 

A similar formula holds (with a different value of C) for any body of revolution moving 
parallel to its axis. In this case a is the radius of the maximum cross sectional area of the body 
perpendicular to the direction of motion. For example, the drag coefficient for a circular disk 
moving at right angles to its own plane is about 1.7. We thus obtain the result that (subject 
to the conditions mentioned above) the drag is proportional to the square of the speed of the 
body through the fluid. This is the quadratic law of resistance. 

This result does not hold for low Reynolds numbers. This was shown theoretically by 
Stokes* in his analysis of the creeping flow of a fluid past a sphere. Stokes proved that, as 
R — 0, the function F(R) ~ 67/R so that the drag formula becomes 


D ~ 6ranV. 


On dimensional grounds, a similar formula (with a different coefficient) should hold for other 
bodies of revolution. Thus at low Reynolds numbers’ the drag is proportional to speed of the 
body through the fluid. This is the linear law of resistance. 

Which (if either) of these laws is appropriate in any particular case depends on the 
Reynolds number. However, it is quickly apparent that the low Reynolds number condition 
requires quite special physical conditions, as the following example shows. 


* In this treatment, the effects of fluid compressibility are neglected. In practice, this means that the speed 
of the body must be well below the speed of sound in the fluid. 

¥ After the great English hydrodynamicist Osborne Reynolds 1842-1912. At the age of twenty six he was 
appointed to the University of Manchester’s first professorship of engineering. 

+ The drag coefficient Cp used by aerodynamicists is 2C /z. 

S$ George Gabriel Stokes 1819-1903, a major figure in British applied mathematics. 

{ Low means R less than about 0.5. 
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Table 1 Some fluid properties relevant to drag calculations (Kaye & Laby [14)]). 


Density p Kinematic viscosity v Sound speed 

(kgm~*) (m*s~') (ms~!) 
Air (20°C, 1 atm.) 1.20 1.50 x 10-> 343 
Water (20°C) 998 1.00 x 10~° 1480 
Castor oil (20°C) 950 1.04 x 1073 1420 


Example 4.4 Which law of resistance? 


A stainless steel ball bearing of radius 1 mm is falling vertically with constant speed in 
air. Find the speed of the ball bearing. [The density of stainless steel is 7800 kg mi. 
If the medium were castor oil, what then would be the speed of the ball bearing? 


Solution 

Suppose the ball bearing is falling with constant speed V. (We will later call V the 
terminal speed of the ball bearing.) Then, since its acceleration is zero, the total of 
the forces acting upon it must also be zero. Thus 


mg+D=n'g, 


where m’ is the mass of the ball bearing, m is the mass of the displaced fluid, and 
D is the drag. The term m'g is the gravity force acting downwards and the term mg 
is the (Archimedes) buoyancy force acting upwards.* (In air, the buoyancy force is 
negligible.) 

Hence, if the linear resistance law holds, then 


6rapvV = tna? (o' _ p) g, 


where a is the radius of the ball bearing, and p’, p are the densities of the ball bearing 
and air respectively. This gives 


/ 
va (4-1). 
Ov p 
On using the numerical values given in Table 4.3 we obtain V = 940 ms~! with the 
corresponding Reynolds number R = 63, 000. Quite apart from the fact that the cal- 
culated speed is nearly three times the speed of sound, this solution is disqualified on 
the grounds that the Reynolds number is 100,000 times too large for the low Reynolds 


number approximation to hold! 
On the other hand, if the quadratic law of resistance holds then 


* Tt is not entirely obvious that the total force exerted by the fluid on the sphere is the sum of the drag and 
buoyancy forces, but it is true for an incompressible fluid. 
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where C is the drag coefficient for a sphere which we will take to be 0.8. In this case 


ee 4rag (4 7 1). 
3C p 
This gives the value V = 19ms~! with the corresponding Reynolds number R = 
1250. This Reynolds number is nicely within the range in which the quadratic resis- 
tance law is applicable, and so provides a consistent solution. Thus the answer is that, 
in air, the ball bearing falls with a speed of 19 ms~!. 

When the medium is castor oil, a similar calculation shows that it is the linear 
resistance law which provides the consistent solution. The answer is that, in castor 
oil, the ball bearing falls with a speed of 1.5 cms~!, the Reynolds number being 
0.015. 

This example illustrates the conditions needed for low Reynolds number flow: 
slow motion of a small body through a sticky fluid. Perhaps the most celebrated 
application of the low Reynolds number drag formula is Millikan’s oil drop method 
of determining the electronic charge (see Problem 4.20 at the end of the chapter). m 


Example 4.5 Vertical motion under gravity with linear resistance 


A body is projected vertically upwards with speed u in a medium that exerts a drag 
force —mK v, where K is a positive constant and v is the velocity of the body.* Find 
the maximum height acheived by the body, the time taken to reach that height, and 
the terminal speed. 


Solution 


On including the linear resistance force, the scalar equation of motion becomes 


du 
m— =—mg —mKov, 
dt 


with the initial condition v = u whent = 0 (see Figure 4.1). This first order ODE for 


v separates in the form 
d 
| ; ~ / a 
ge+Kuv 


and, on integration, gives 


1 
— | Kv) =-t+C, 
XK n(g + Kv) + 


where C is the integration constant. On applying the initial condition v = u when 
t = 0, we obtainC = K~!In(g + Ku) and so 


1 g+Ku 
t= —In : 
K gt+Kuv 


* This is the vector drag force acting on the body; hence the minus sign. The coefficient is taken in the 
form mK for algebraic convenience. 
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This expression gives ¢ in terms of v, which is what we need for finding the time taken 
to reach the maximum height. The maximum height is achieved when v = 0 so that 
tT, the time taken to reach the maximum height, is given by 


The expression for ¢ in terms of v can be inverted to give 
payee = (1 — eK") 


for the upward velocity of the body at time r. 
The terminal speed of the body is the limit of |v| as t + oo. In this limit, the 
exponential terms tend to zero and 
ipa, 
K 
Thus, in contrast to motion with no resistance, the speed of the body does not increase 
without limit as it falls, but tends to the finite value g/K. Thus the terminal speed of 
the body is g/K. 

The terminal speed can also be deduced directly from the equation of motion. 
If the body is falling with the terminal speed, then dv/dt = O and the equation of 
motion implies that 0 = —mg — mK v. It follows that the (upward) terminal velocity 
is —g/K. 

The maximum height zmax can now be found by integrating the equation 
dz/dt = v and then putting t = t. However we can also obtain Zmax by starting 
again with a modified equation of motion. For some laws of resistance, this trick is 
essential. If we write 


the equation of motion becomes 
v— =-g-—Kv, 
with the initial condition v = u when z = 0. This equation also separates to give 


udv 
form [pteen a! (yam) 
a ae 


In(g + Kv) + D, 


“Kk « re 
where D is the integration constant. On applying the initial condition v = u when 


z = 0, we obtain 


z=—E tas Fy In(g + Kv) + & — 5 In(g + Ku) 


1 g gt+Ku 
= —(u —v) In ; 
K K2 e+Kuv 


4.3. Motion through a resisting medium 87 


This expression cannot be inverted to give v as a function of z, but it is exactly what 
we need to find Zmax. Since Zmax 1S achieved when v = 0, we find that the maximum 
height acheived by the body is given by 


u g Ku 
2max = 3 — 53 In pee .i 


Question Approximate form of zmax for small Ku/g 


Find an approximate expression for zmax when Ku/g is small. 


Answer 
When Ku/g is small, the log term can be expanded as a power series. This gives 


u g | Ku Riu\* kay 
Zmax = 5 +4(—) +--- 
K K2 g 2 g 3 g 


“Ef-2)~-] 


In this expression, the leading term u7/2g is just the value of zmax in the absence 
of resistance. The first correction term has a negative sign which means that Zmax is 
reduced by the presence of resistance, as would be expected. ™ 


Question Ball bearing released in castor oil 


The ball bearing in Example 4.4 is released from rest in castor oil. How long does it 
take for the ball bearing to achieve 99% of its terminal speed? 


Answer 

Recall that the linear law of resistance is appropriate for this motion. Since the motion 
is entirely downwards, it is more convenient to measure v downwards in this problem, 
in which case the solution for v becomes 


v=Z(i-e*) =v (1-e#/”), 


where V is the terminal velocity. When v = 0.99V, e~8'/Y = 0.01 and so the time 
required is 


t = In(100)V/g, 


which evaluates to about 7 milliseconds on using the value for V calculated in Exam- 
ple 4.4. @ 


Note on the sign of resistance forces In the last example we used the same scalar equation of 
motion whether the body was rising or falling. This is correct in the case of linear resistance 
since, when the sign of v is reversed, so is the sign of K v. In the case of quadratic resistance 
however, when the sign of v is reversed, the sign of K v? remains unchanged and so the correct 
sign must be inserted manually. Thus, for guadratic resistance, the scalar equations of motion 
for ascent and descent are different. The same is true when the drag is proportional to any 
even power of v. 
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FIGURE 4.6 A particle, initially at the origin, 
is projected with speed wu in a direction 
making an angle @ with the horizontal. The 
particle moves under the uniform gravity 
force —mgk and the resistance (drag) 

force D. 


4.4 PROJECTILES 


A body that moves freely under uniform gravity, and possibly air resistance, is called 
a projectile. Projectile motion is very common. In ball games, the ball is a projectile, and 
controlling its trajectory is a large part of the skill of the game. On a larger scale, artillery 
shells are projectiles, but guided missiles, which have rocket propulsion, are not. 

The projectile problem differs from the problems considered in section 4.3 in that projec- 
tile motion is not restricted to take place in a vertical straight line. However, we will continue 
to assume that the effect of the air is to exert a drag force opposing the current velocity of 
the projectile.* It is then evident by symmetry that each projectile motion takes place in a 
vertical plane; this vertical plane contains the initial position of the projectile and is parallel 
to its initial velocity. 


Projectiles without resistance 


The first (and easiest) problem is that of a projectile moving without air resistance. This is fine 
on the Moon, but will be only an approximation to projectile motion on Earth. The effect of 
air resistance can be very significant, as our later examples will show. 


Example 4.6 Projectile without air resistance 


A particle which is subject solely to uniform gravity is projected with speeed u in a 
direction making an angle a with the horizontal. Find the subsequent motion. 


Solution 


Suppose that the motion takes place in the (x, z)-plane as shown in Figure 4.6. In the 
absence of the drag force, the vector equation of motion becomes 


with the initial condition v = (ucosa)i + (usina)k when t = 0. If we now write 
v = vyli + v-k and take components of this equation (and initial condition) in the 


* This will be true if the projectile is a rigid sphere moving without rotation. 
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i- and k-directions, we obtain the two scalar equations of motion 


dvx dv; 
= 0, —=-g, 
dt dt 
with the respective initial conditions v, = ucosa@ and v, = usina whent = 0. 


Simple integrations then give the components of the particle velocity to be 
Vy = UCOSA, v; =usina — gt. 


The position of the particle at time ¢ can now be found by integrating the expressions 
for v;, vz and applying the initial conditions x = 0 and z = 0 when? = 0. This gives 


x = (ucosa)t, z = (usina)t — }gr?, 


the solution for the trajectory of the particle. ™ 


Question Form of the path 
Show that the path taken by the particle is an inverted parabola. 


Answer 
To find the path, eliminate ¢ from the trajectory equations. This gives 
& 2 
zZ = (tana) x — (-)« A 
( ) 2u? cos? a 


which is indeed an inverted parabola. @ 


Question Time of flight and the range 
Find the time of flight and the range of the projectile on level ground. 


Answer 

On level ground, the motion will terminate when z = 0 again. From the second 
trajectory equation, this happens when (u sina) t — 5 gt? = 0. Hence the time of 
flight t is given by tT = 2usina/g. The horizontal range R is then obtained by 
putting ¢ = Tt in the first trajectory equation, which gives 


u sin 2a 


§ 


Question Maximum range 
Find the value of a that gives the maximum range on level ground when u is fixed. 


Answer 


R isa maximum when sin 2a = 1, that is when w = 7/4 in which case Rmax = u>/¢. 
Thus, if an artillery shell is to be projected over a horizontal range of 4 km, then the 
gun must have a muzzle speed of at least 200 ms~!. m 


There is a myriad of problems that can be found on the projectile with no air resistance, 
and some interesting examples are included at the end of the chapter. It should be noted 
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though that all these problems are dynamically equivalent to the problem solved above. Any 
difficulties lie in the geometry! 


Projectiles with resistance 


We now proceed to include the effect of air resistance. From our earlier discussion of fluid 
drag, it is evident that in most practical instances of projectile motion through the Earth’s 
atmosphere, it is the quadratic law of resistance that is appropriate. On the other hand, only 
the linear law of resistance gives rise to linear equations of motion and simple analytical 
solutions. This explains why mechanics textbooks contain extensive coverage of the linear 
case, even though this case is almost never appropriate in practice; the case that is appropriate 
cannot be solved! In the following example, we treat the linear resistance case. 


Example 4.7 Projectile with linear resistance 


A particle is subject to uniform gravity and the linear resistance force —mK v, where 
K is a positive constant and v is the velocity of the particle. Initially the particle is 
projected with speed u in a direction making an angle w with the horizontal. Find the 
subsequent motion. 


Solution 
With the linear resistance term included, the vector equation of motion becomes 


us K k 
m— = —mKv-—megk, 
dt = 
with the initial condition v = (ucosa)i + (usina)k when t = 0. As in the last 


example, this equation resolves into the two scalar equations of motion 


dvx dvz 
—+ Kv, = 0, —4+Kv,=-g, 
dt : dt - 
with the respective initial conditions vy. = ucosa@ and v, = usina whent = 0. 


These first order ODEs are both separable and linear and can be solved by either 
method; if they are regarded as linear, the integrating factor is eX’. The equations 
integrate to give the components of the particle velocity to be 


Vy = (u cosa)e **, vz = (u sina)e *! _ = (1 _ eK) : 


The position of the particle at time ¢ can now be found by integrating the expressions 
for vx, vz and applying the initial conditions x = 0 and z = 0 whent = 0. This gives 


x a Loose (ine), z= SUES ek) g " (4,3) 
K K2 K 


the solution for the trajectory of the particle. Figure 4.7 shows typical paths taken 
by the particle for the same initial conditions and three different values of the dimen- 
sionless resistance parameter 4 (= Ku/g). (The case A = O corresponds to zero 
resistance so that the path is a parabola.) It is apparent that resistance can have a 
dramatic effect on the motion. lH 
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(u2/g) 2 


0.4 


\=0 (parabola) 


—0.4 


FIGURE 4.7 Projectile motion under uniform gravity and linear 
resistance. The graphs show the paths of the particle for a = 2/3 and 
three different values of the dimensionless resistance parameter i 

(= Ku/g). Except when A = 0, the paths have vertical asymptotes. 


Question Vertical asymptote of the path 
Show that the path has a vertical asymptote. 


Answer 


Since e~*' decreases and tends to zero as t > 00, it follows from equations (4.3) that 
the horizontal displacement x increases and tends to the value u cosa/K ast —> oo, 
while the vertical displacement z tends to negative infinity. Thus the vertical line 
x = ucosa/K is an asymptote to the path. In terms of the dimensionless variables 
used in Figure 4.7, this is the line (u?/g)~!x = cosa/A. ll 


Question Approximate formula for the range when ) is small 


Find an approximate formula for the range on level ground when the resistance param- 
eter A is small. 


Answer 


Since the particle returns to Earth again when z = 0, it follows from the second of equations (4.3) 
that the flight time T satisfies the equation 


(Kusina + g) (1 - ge) —Kgt=0, 
which can be written in the form 
(sina + 1) (1-e7K") — Kr =0, (4.4) 
where A4(= Ku/g) is the dimensionless resistance parameter. Unfortunately, this equation cannot 


be solved explicity for t, and hence the need for an approximate solution. We know from the 
last example that, in the absence of resistance, the flight time t is given by tT = 2u sina/g. It is 
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R/Ro 


3-term approximation 


numerical solution 


2-term approximation 


A sin @ 


0.5 


FIGURE 4.8 The ratio R/Ro plotted against A sina. 


reasonable then, when A is small, to seek a solution for t in the form 
2u sina 


— [1+ bid + bod? +2 iF (4.5) 


é 


where the coefficients b;,b2,... are to be determined. To find the expansion coefficients we 
substitute the expansion (4.5) (truncated after the required number of terms) into the left side of 
equation (4.4), re-expand in powers of 4, and then set the coefficients in this expansion equal to 
zero. The corresponding formula for the range can then be found by substituting this approximate 
formula for t into the first equation of (4.3) and re-expanding in powers of A. The details are 
tedious and, in fact, such operations are best done with computer assistance. The completion 
of this solution is the subject of computer assisted Problem 4.34 at the end of this chapter. The 
answer (to three terms) is that the range R on level ground is given by 


R_,_(4sina) 14 sin? a 62065 
Ro 3 9 : 


where Rg is the range when resistance is absent. Figure 4.8 compares two different approxima- 


tions to R with the ‘exact’ value obtained by numerical solution of equation (4.4). As would be 


expected, the three term approximation is closer to the exact value. 


CIRCULAR MOTION 


In this section we examine some important problems in which a body moves on a 


circular path. Our first problem is concerned with a body executing a circular orbit under the 
gravitational attraction of a fixed mass. This is a fairly accurate model of the motion of the 
planets* around the Sun. 


Example 4.8 Circular orbit in the inverse square field 


A particle of mass m moves under the gravitational attraction of a fixed mass M 
situated at the origin. Show that circular orbits with centre O and any radius are 


* The orbits of Mercury, Mars and Pluto are the most elliptical with eccentricities of 0.206, 0.093 and 0.249 
respectively. The eccentricity of Earth’s orbit is 0.017. 
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possible, and find the speed of the particle in such an orbit. Deduce the period of the 
orbit. 


Solution 

Note that we are not required to find the general orbit; we may assume from the start 
that the orbit is a circle. Suppose then that the particle is executing a circular orbit 
with centre O and radius R. We need to confirm that the vector equation of motion 
can be satisfied. Take polar coordinates r, 9 with centre at O. Then the acceleration a 
of the particle is given in terms of the usual polar unit vectors by the formula (2.14), 


that is, 
a= (i - re )F+ (r6 + 276) 0 
a+ 50 
=—-—T+0 
R 
for motion on the circle r = R, where the circumferential velocity v = RO. The 


equation of motion for the particle is therefore 
v2 aa ~ mM G Jas 
m|—-—Fr+00|=— r, 

R R 


which, on taking components in the radial and transverse directions, gives 


v° MG 


eee and v=0. 


Hence the equation of motion is satisfied if v is a constant given by 


MG 
P= —, 
R 
Thus a circular orbit of radius R is possible provided that the particle has constant 
speed (VG/R)!/2. 
The period t of the orbit is the time taken for one circuit and is given by 


2nR eas 
C= = : 


v MG 


Thus the square of the period of a circular orbit is proportional to the cube of its 
radius. This is a special case of Kepler’s third law of planetary motion (see Chapter 
7). 


A particle may move on a circular path because it is constrained to do so. The simplest 
and most important example of this is the simple pendulum, a mass suspended from a fixed 
point by a string. 


Example 4.9 The simple pendulum 
A particle P is suspended from a fixed point O by a light inextensible string of length 


b. P is subject to uniform gravity and moves in a vertical plane through O with the 
string taut. Find the equation of motion. 
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FIGURE 4.9 The simple pendulum 
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™ 


mg 
Solution 
The system is shown in Figure 4.9. Since the string is of fixed length 5, the position 


of P is determined by the angle @ shown. The acceleration of P can be expressed in 
the polar form 


a = — (b6?)F + (b6) 6, 
(6°) F-+ (66) 


where F and 0 are the polar unit vectors shown in Figure 4.9. 

P moves under the uniform gravity force —mgk and the tension T in the string 
which acts in the direction —7. It should be noted that the tension T is a force of 
constraint and not known beforehand. The equation of motion is therefore 


m E (06?) F + (06) | = —mgk —TT. 


If we now take components of this equation in the radial and transverse directions we 
obtain 


—mbé* = mg cos6 —T, mbé = —mg sing. 


The second of these equations is the effective equation of motion in terms of the 
‘coordinate’ 6, namely, 


a4 (5) sind = 0, (4.6) 


while the first equation determines the unknown tension T once @(t) is known. 

Equation (4.6) is the exact equation of motion for the simple pendulum. 
Because of the presence of the term in sin@, this second order ODE is non-linear 
and cannot be solved by using the standard technique for linear ODEs with constant 
coefficients. ll 


Question The linear theory for small amplitude oscillations 


Find an approximate linear equation for the case in which the pendulum undergoes 
oscillations of small amplitude. 
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Answer 

If 6 is always small then sin@ can be approximated by @ in which case the equation 
of motion becomes 

& 
b 
This is the linearised equation for the simple pendulum, which holds approximately 
for oscillations of small amplitude. Although we do not cover linear oscillations 
until Chapter 5, many readers will recognise equation (4.7) as the simple harmonic 
motion equation and will know that the period t of the oscillations is given by tT = 
2x (b/g)'/?, independent of the (small) amplitude. m 


d+( )o=0. (4.7) 


Question Period of large oscillations 


Find the period of the pendulum when the (angular) amplitude of its oscillations is 
a, where a may not be small. 


Answer 


This requires that we integrate the exact equation of motion (4.6). We start with a familiar trick. 
If we write Q = 6, then 


dQ dQ dé dQ 
= x Q 
dt do dt do 


and the equation of motion becomes 


a= 


dQ ( g 
dos \b 
This is a separable first order ODE for Q which integrates to give 


) sina, 


40 = (2) cosd+C, 


b 
where C is the constant of integration. On applying the initial condition Q = 0 when 6 = a, we 
find that C = —(g/ b)'/? cosa and the integrated equation can be written 
A Neer (4.8) 
= cos COs @) , : 
dt b 


where we have now replaced Q by d6/dt. 

Since the pendulum comes to rest only when d@/dt = 0 (that is, when 0 = +a) it follows 
that @ must oscillate in the range —a < 6 < a. The period T is the time taken for one complete 
oscillation but, by the symmetry of equation (4.8) under the transformation 6 + —8, it follows 
that the time taken for the pendulum to swing from 0 = 0 to 6 = +a is t/4. To evaluate this time 
we take the positive square root of each side of equation (4.8) and integrate over the time interval 
0<t< 1/4. This gives 


a dé 2 1/2 t/4 
= At, 
[ (cos 6 — cosa)!/2 ( b ) [ : 


apt? fe do 
= ge TaneD-—.caewylle: 4.9 
is ( & ) [ (cos 6 — cosa)! /2 (4.9) 


so that 
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This is the exact period of the pendulum when the amplitude of its oscillations is a. It is not 
possible to perform this integration in terms of standard functions* and so the integral must either 
be evaluated numerically or be approximated. 

Numerical evaluation shows that the exact period is longer than that predicted by the lin- 
earised theory. When a = 1/6, the period is 1.7% longer, and when aw = z/3 it is 7.3% longer. 
The period can also be approximated by expanding the integral in equation (4.9) as a power series 
in a. This is the subject of Problem 4.35 at the end of the chapter. The answer is that, expanded 


to two terms, 
p\ 1/2 ae 
t=20r(— Le Ole ily (4.10) 
g 16 


This two term approximation predicts an increase in the period of 1.7% when a = 2/6. Note 
that there is no term in this expansion proportional to a and that the term in a” has the small 
coefficient 1/16. This explains why the prediction of the linearised theory is rather accurate even 


when @ is not so small! 


In our final example, we solve the important problem of an electrically charged particle 


moving in a uniform magnetic field. It turns out that plane motions are circular, but the most 
general motion is helical. The solution in this case differs from the previous examples in that 
we use Cartesian coordinates instead of polars. This is because we do not know beforehand 
where the centre of the circle (or the axis of the helix) is, which means that we do not know on 
which point (or axis) to centre the polar coordinates. 


Example 4.10 Charged particle in a magnetic field 


A particle of mass m and charge e moves in a uniform magnetic field of strength Bo. 
Show that the most general motion is helical with the axis of the helix parallel to the 
direction of the magnetic field. 


Solution 


The total force F that an electric field E and magnetic field B exert on a charge e is 
given by the Lorentz force formula’ 


F=cE+evxB, 


where v is the velocity of the charge. In our problem, there is no electric field and 
the magnetic field is uniform. If the direction of B is the z-direction of Cartesian 
coordinates, then B = Bok. The equation of motion of the particle is then 


dv 
m— = eBovxk. 
dt 
If we now write v in the component form v = vyi + vy j + uzk, the vector equation 
of motion resolves into the three scalar equations 
dvx dvy dvz 


6), 2 testo, 726, 4.11 
dt vy dt m dt wa) 


* The integral is related to a special function called the complete elliptic integral of the first kind. 
* This form is correct in SI units. 
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where 
Q = eBo/m. (4.12) 


The last of these equations shows that v, = V, a constant, so that the component of 
v parallel to the magnetic field is a constant. The first two equations are first order 
coupled ODEs but they are easy to uncouple. If we differentiate the first equation 
with respect to ¢ and use the second equation, we find that v, satisfies the equation 


dv, 
dt? 


+ 27 vy = 0: 


This equation is a second order ODE with constant coefficients and can be solved in 
the standard way. However, many readers will recognise this as the SHM equation 
whose general solution can be written in the form 


v, = Asin(Qt+a), 


where A and «@ are arbitrary constants. It is more convenient if we introduce a new 
arbitrary constant R defined by A = —QR, so that 


vy = —QR sin(Qt + a). 
If we now substitute this formula for v, into the first equation of (4.11), we obtain 
vy = —QR cos(Qt + a). 


Having obtained the solution for the three components of v, we can now find the 
trajectory simply by integrating with respect to ¢. This gives 


x=Reos(Qt+a)+a, y=—Rsin(Qt+a)+b, z=Vt+c, 


where a, b, and c are constants of integration. These constants may be removed by 
a shift of the origin of coordinates to the point (a, b,c), and the constant a may be 
removed by a shift in the origin of f. Also, the constant R may be assumed positive; 
if it is not, make a shift in the origin of ¢ by 2/2. With these simplifications, the final 
form for the trajectory is 


x = Reos Qt, y=—-RsnQ, Z=Vt, (4.13) 


where R is a positive constant and Q = eBo/m. This is the most general trajectory 
for a charged particle moving in a uniform magnetic field. 

To identify this trajectory as a helix, suppose first that V = 0 so that the motion 
takes place in the (x, y)-plane. Then the first two equations of (4.13) imply that 
the path is a circle of radius R traversed with constant speed R|Q| and with period 
27 /|Q|. When V + 0, this circular motion is supplemented by a uniform velocity V 
in the z-direction. The result is a helical path of radius R, with its axis parallel to the 
magnetic field, which is traversed with constant speed (V7 + R?Q7)!/?. m 


The above problem has important applications to the cyclotron particle accelerator and 
the mass spectrograph. The cyclotron depends for its operation on thefact that the constant 
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Q, known as the cyclotron frequency, is independent of the velocity of the charged particles. 
The mass spectrograph is the subject of Problem 4.32 at the end of the chapter. 


Problems on Chapter 4 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Introductory problems 


4.1 Two identical blocks each of mass M are connected by a light inextensible string and can 
move on the surface of a rough horizontal table. The blocks are being towed at constant speed 
in a straight line by a rope attached to one of them. The tension in the tow rope is To. What is 
the tension in the connecting string? The tension in the tow rope is suddenly increased to 47. 
What is the instantaneous acceleration of the blocks and what is the instantaneous tension in 
the connecting string? 


4.2 A body of mass M is suspended from a fixed point O by an inextensible uniform rope 
of mass m and length b. Find the tension in the rope at a distance z below O. The point of 
support now begins to rise with acceleration 2g. What now is the tension in the rope? 


4.3 Two uniform lead spheres each have mass 5000 kg and radius 47 cm. They are released 
from rest with their centres | m apart and move under their mutual gravitation. Show that they 
will collide in /ess than 425 s. [G = 6.67 x 107!'! Nm? kg~?.] 


4.4 The block in Figure 4.2 is sliding down the inclined surface of a fixed wedge. This time 
the frictional force F' exerted on the block is given by F = wN, where N is the normal reaction 
and ju is a positive constant. Find the acceleration of the block. How do the cases w < tana 
and jz > tana differ? 


4.5 A stuntwoman is to be fired from a cannon and projected a distance of 40 m over level 
ground. What is the least projection speed that can be used? If the barrel of the cannon is 
5m long, show that she will experience an acceleration of at least 4g in the barrel. [Take 
g=10ms~?.] 


4.6 Inan air show, a pilot is to execute a circular loop at the speed of sound (340 ms~!). The 
pilot may black out if his acceleration exceeds 8g. Find the radius of the smallest circle he can 
use. [Take g = 10 ms~?.] 


4.7 A body has terminal speed V when falling in still air. What is its terminal velocity (relative 
to the ground) when falling in a steady horizontal wind with speed U? 


4.8 Cathode ray tube A particle of mass m and charge e is moving along the x-axis with 
speed u when it passes between two charged parallel plates. The plates generate a uniform 
electric field Eo j in the region 0 < x < b and no field elsewhere.* Find the angle through 


* This is only approximately true. 
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which the particle is deflected by its passage between the plates. [The cathode ray tube uses 
this arrangement to deflect the electron beam. ] 


Straight line motion in a force field 


4.9 An object is dropped from the top of a building and is in view for time t while passing a 
window of height / some distance lower down. How high is the top of the building above the 
top of the window? 


4.10 A particle P of mass m moves under the gravitational attraction of a mass M fixed at 
the origin O. Initially P is at a distance a from O when it is projected with the critical escape 
speed (2MG/a)!/? directly away from O. Find the distance of P from O at time t, and 
confirm that P escapes to infinity. 


4.11 A particle P of mass m is attracted towards a fixed origin O by a force of magnitude 
my /r?, where r is the distance of P from O and y is a positive constant. [It’s gravity Jim, 
but not as we know it.] Initially, P is at a distance a from O, and is projected with speed u 
directly away from O. Show that P will escape to infinity if u? > y/a?. 

For the case in which u* = y/(2a7), show that the maximum distance from O achieved 
by P in the subsequent motion is /2a, and find the time taken to reach this distance. 


4.12 If the Earth were suddenly stopped in its orbit, how long would it take for it to collide 
with the Sun? [Regard the Sun as a fixed point mass. You may make use of the formula for the 
period of the Earth’s orbit.] 


Constrained motion 


4.13 A particle P of mass m slides on a smooth horizontal table. P is connected to a second 
particle Q of mass M by a light inextensible string which passes through a small smooth hole 
O in the table, so that Q hangs below the table while P moves on top. Investigate motions 
of this system in which Q remains at rest vertically below O, while P describes a circle with 
centre O and radius b. Show that this is possible provided that P moves with constant speed 
u, where u2 = Mgb/m. 


4.14 A light pulley can rotate freely about its axis of symmetry which is fixed in a horizontal 
position. A light inextensible string passes over the pulley. At one end the string carries a mass 
4m, while the other end supports a second light pulley. A second string passes over this pulley 
and carries masses m and 4m at its ends. The whole system undergoes planar motion with the 
masses moving vertically. Find the acceleration of each of the masses. 


4.15 A particle P of mass m can slide along a smooth rigid straight wire. The wire has one of 
its points fixed at the origin O, and is made to rotate in the (x, y)-plane with angular speed Q. 
By using the vector equation of motion of P in polar co-ordinates, show that 7, the distance of 
P from O, satisfies the equation 


#-Q?r =0, 
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and find a second equation involving NV, where N@ is the force the wire exerts on P. [Ignore 
gravity in this question. ] 

Initially, P is at rest (relative to the wire) at a distance a from O. Find r as a function of ¢ 
in the subsequent motion, and deduce the corresponding formula for NV. 


Resisted motion 


4.16 A body of mass m is projected with speed uv in a medium that exerts a resistance force 
of magnitude (i) mk|v|, or (ii) mK |v|?, where k and K are positive constants and v is the 
velocity of the body. Gravity can be ignored. Determine the subsequent motion in each case. 
Verify that the motion is bounded in case (i), but not in case (ii). 


4.17 A body is projected vertically upwards with speed u and moves under uniform gravity 
in a medium that exerts a resistance force proportional to the square of its speed and in which 
the body’s terminal speed is V. Find the maximum height above the starting point attained by 
the body and the time taken to reach that height. 

Show also that the speed of the body when it returns to its starting point is wV/(V?7 + 
u2)!/2. [Hint. The equations of motion for ascent and descent are different. See the note at the 
end of section 4.3.] 


4.18% A body is released from rest and moves under uniform gravity in a medium that exerts 
a resistance force proportional to the square of its speed and in which the body’s terminal speed 
is V. Show that the time taken for the body to fall a distance h is 


uo cosh~! (es¥/¥?) : 
& 


In his famous (but probably apocryphal) experiment, Galileo dropped different objects 
from the top of the tower of Pisa and timed how long they took to reach the ground. If Galileo 
had dropped two iron balls, of 5 mm and 5 cm radius respectively, from a height of 25 m, what 
would the descent times have been? Is it likely that this difference could have been detected? 
[Use the quadratic law of resistance with C = 0.8. The density of iron is 7500 kg m7~>.] 


4.19 A body is projected vertically upwards with speed u and moves under uniform gravity in 
a medium that exerts a resistance force proportional to the fourth power its speed and in which 
the body’s terminal speed is V. Find the maximum height above the starting point attained by 
the body. 

Deduce that, however large u may be, this maximum height is always less than 1V7/4g. 


4.20 Millikan’s experiment A microscopic spherical oil droplet, of density and unknown 
radius, carries an unknown electric charge. The droplet is observed to have terminal speed v; 
when falling vertically in air of viscosity 4. When a uniform electric field Eo is applied in the 
vertically upwards direction, the same droplet was observed to move upwards with terminal 
speed v2. Find the charge on the droplet. [Use the low Reynolds number approximation for 
the drag.] 
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Projectiles 


4.21 A mortar gun, with a maximum range of 40 m on level ground, is placed on the edge of 
a vertical cliff of height 20 m overlooking a horizontal plain. Show that the horizontal range 
R of the mortar gun is given by 


1 
R = 40 sina ++ (1 + sina) } cosa 


where a is the angle of elevation of the mortar above the horizontal. [Take g = 10 ms~?.] 

Evaluate R (to the nearest metre) when a = 45° and 35° and confirm that a = 45° does 
not yield the maximum range. [Do not try to find the optimum projection angle this way. See 
Problem 4.22 below. ] 


4.22 It is required to project a body from a point on level ground in such a way as to clear a 
thin vertical barrier of height / placed at distance a from the point of projection. Show that 
the body will just skim the top of the barrier if 


2 2 
($5) tan? @w — a tana + (ss + n) = 0, 


where u is the speed of projection and aq is the angle of projection above the horizontal. 
Deduce that, if the above trajectory is to exist for some a, then u must satisfy 


i= 2ehu = ga’ > 0. 


Find the least value of u that satisfies this inequality. 
For the special case in which a = \/3h, show that the minimum projection speed neces- 
sary to clear the barrier is (3gh) 2, and find the projection angle that must be used. 


4.23 A particle is projected from the origin with speed u in a direction making an angle a 
with the horizontal. The motion takes place in the (x, z)-plane, where Oz points vertically 
upwards. If the projection speed u is fixed, show that the particle can be made to pass through 
the point (a, b) for some choice of a if (a, b) lies below the parabola 


This is called the parabola of safety. Points above the parabola are ‘safe’ from the projectile. 

An artillery shell explodes on the ground throwing shrapnel in all directions with speeds 
of up to 30 ms~!. A man is standing at an open window 20 m above the ground in a building 
60 m from the blast. Is he safe? [Take g = 10 ms~?.] 


4.24 A projectile is fired from the top of a conical mound of height / and base radius a. What 
is the least projection speed that will allow the projectile to clear the mound? [Hint. Make use 
of the parabola of safety.] 

A mortar gun is placed on the summit of a conical hill of height 60 m and base diameter 
160 m. If the gun has a muzzle speed of 25 ms~!, can it shell anywhere on the hill? [Take 
g=10ms~7.] 
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4.25 An artillery gun is located on a plane surface inclined at an angle 6 to the horizontal. 
The gun is aligned with the line of steepest slope of the plane. The gun fires a shell with speed 
u in the direction making an angle a with the (upward) line of steepest slope. Find where the 
shell lands. 

Deduce the maximum ranges RU, R”, up and down the plane, and show that 


RY _ 1l-sinB 
RP 1+sinB’ 


4.26 Show that, when a particle is projected from the origin in a medium that exerts linear 
resistance, its position vector at time ¢ has the general form 


r=—a(t)k + B(t)u, 


where k is the vertically upwards unit vector and u is the velocity of projection. Deduce the 
following results: 


(i) A number of particles are projected simultaneously from the same point, with the 
same speed, but in different directions. Show that, at each later time, the particles all 
lie on the surface of a sphere. 

(ii) A number of particles are projected simultaneously from the same point, in the same 
direction, but with different speeds. Show that, at each later time, the particles all lie 
on a straight line. 

(iii) Three particles are projected simultaneously in a completely general manner. Show 
that the plane containing the three particles remains parallel to some fixed plane. 


4.27 A body is projected in a steady horizontal wind and moves under uniform gravity and 
linear air resistance. Show that the influence of the wind is the same as if the magnitude and 
direction of gravity were altered. Deduce that it is possible for the body to return to its starting 
point. What is the shape of the path in this case? 


Circular motion and charged particles 


4.28 The radius of the Moon’s approximately circular orbit is 384,000 km and its period is 
27.3 days. Estimate the mass of the Earth. [G = 6.67 x 10~!! Nm? kg~?.] The actual mass 
is 5.97 x 10°4 kg. What is the main reason for the error in your estimate? 

An artificial satellite is to be placed in a circular orbit around the Earth so as to be “geo- 
stationary’. What must the radius of its orbit be? [The period of the Earth’s rotation is 23 h 56 
m, not 24h. Why?] 


4.29 Conical pendulum A particle is suspended from a fixed point by a light inextensible 
string of length a. Investigate ‘conical motions’ of this pendulum in which the string maintains 
a constant angle a with the downward vertical. Show that, for any acute angle a, a conical 
motion exists and that the particle speed wu is given by u? = ag sina tana. 


4.30 A particle of mass m is attached to the highest point of a smooth rigid sphere of radius 
a by a light inextensible string of length za/4. The particle moves in contact with the outer 
surface of the sphere, with the string taut, and describes a horizontal circle with constant 
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speed u. Find the reaction of the sphere on the particle and the tension in the string. Deduce 
the maximum value of u for which such a motion could take place. What will happen if u 
exceeds this value? 


4.31 A particle of mass m can move on a rough horizontal table and is attached to a fixed 
point on the table by a light inextensible string of length b. The resistance force exerted on 
the particle is —mKv, where v is the velocity of the particle. Initially the string is taut and 
the particle is projected horizontally, at right angles to the string, with speed u. Find the angle 
turned through by the string before the particle comes to rest. Find also the tension in the string 
at time ¢. 


4.32 Mass spectrograph A stream of particles of various masses, all carrying the same 
charge e, is moving along the x-axis in the positive x-direction. When the particles reach the 
origin they encounter an electronic ‘gate’ which allows only those particles with a specified 
speed V to pass. These particles then move in a uniform magnetic field Bo acting in the z- 
direction. Show that each particle will execute a semicircle before meeting the y-axis at a 
point which depends upon its mass. [This provides a method for determining the masses of 
the particles.] 


4.33 The magnetron An electron of mass m and charge —e is moving under the combined 
influence of a uniform electric field Eg j and a uniform magnetic field Bok. Initially the 
electron is at the origin and is moving with velocity ui. Show that the trajectory of the electron 
is given by 


x =a(Qt) + bsin Qt, y = b(1 — cos 2), z=0, 


where Q = eBo/m,a = Eo/QBo and b = (uBo— Eo) / QBo. Use computer assistance to plot 
typical paths of the electron for the cases a < b,a = banda > b. [The general path is called a 
trochoid, which becomes a cycloid in the special case a = b. Cycloidal motion of electrons is 
used in the magnetron vacuum tube, which generates the microwaves in a microwave oven.] 


Computer assisted problems 


4.34 Complete Example 4.7 on the projectile with linear resistance by obtaining the quoted 
asymptotic formula for the range of the projectile. 


4.35 Find a series approximation for the period of the simple pendulum, in powers of the 
angular amplitude a. Proceed as follows: 
The exact period t of the pendulum was found in Example 4.9 and is given by the integral (4.9). This 


integral is not suitable for expansion as it stands. However, if we write cos@ — cosa = 2(sin? (a /2) - 
sin? (0/2)) and make the sneaky substitution sin(@/2) = sin(a@/2) sin ¢, the formula for tT becomes 


1/2 px/2 -1/2 
r=4(?) i (1 — ¢?sin* 4) : do 
§ 0 


where € = sin(a@/2). This new integrand is easy to expand as a power series in the variable € and the limits 
of integration are now constants. Use computer assistance to expand the integrand to the required number 


of terms and then integrate term by term over the interval [0, 7/2]. Finally re-expand as a power series 
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in the variable w. The answer to two terms is given by equation (4.10), but it is just as easy to obtain any 


number of terms. 


4.36 Baseball trajectory A baseball is struck with an initial speed of 45 ms~! (just over 100 
mph) at an elevation angle of 40°. Find its path and compare this with the corresponding path 
when air resistance is neglected. [A baseball has mass 0.30 kg and radius 3.5 cm. Assume the 
quadratic law of resistance. ] 


Show that the equation of motion can be written in the form 


dv _ pg 2 
a * v2}? 


where V is the terminal speed. Resolve this vector equation into two (coupled) scalar equations for v, and 


vz and perform a numerical solution. In this example, air resistance reduces the range by about 35%. It 


really is easier to hit a home run in Mile High stadium! 


Chapter Five 
SS S=]S>:cCoooeeeeoe 


Linear oscillations 


and normal modes 


KEY FEATURES 
The key features of this chapter are the properties of free undamped oscillations, free damped 
oscillations, driven oscillations, and coupled oscillations. 


Oscillations are a particularly important part of mechanics and indeed of physics as a 
whole. This is because of their widespread occurrence and the practical importance of oscil- 
lation problems. In this chapter we study the classical linear theory of oscillations, which 
is important for two reasons: (i) the linear theory usually gives a good approximation to the 
motion when the amplitude of the oscillations is small, and (11) in the linear theory, most prob- 
lems can be solved explicitly in closed form. The importance of this last fact should not be 
underestimated! We develop the theory in the context of the oscillations of a body attached to a 
spring, but the same equations apply to many different problems in mechanics and throughout 
physics. 

In the course of this chapter we will need to solve linear second order ODEs with constant 
coefficients. For a description of the standard method of solution see Boyce & DiPrima [8]. 


5.1 BODY ON A SPRING 


Suppose a body of mass m is attached to one end of a light spring. The other end of 
the spring is attached to a fixed point A on a smooth horizontal table, and the body slides on 


Equilibrium position AN) H m ! 
In motion Ha H m| —»v 


Forces R —_ 


FIGURE 5.1 The body m is attached to one end of a light spring and moves in a 
straight line. 
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the table in a straight line through A. Let x be the displacement and v the velocity of the body 
at time ¢, as shown in Figure 5.1; note that x is measured from the equilibrium position of the 
body. 

Consider now the forces acting on the body. When the spring is extended, it exerts a 
restoring force S in the opposite direction to the extension. Also, the body may encounter 
a resistance force R acting in the opposite direction to its velocity. Finally, there may be an 
external driving force G(r) that is a specified function of the time. The equation of motion for 
the body is then 

du 
m—=-—-S—R+G(t). (5.1) 
dt 

The restoring force S is determined by the design of the spring and the extension x. For 
sufficiently small strains,* the relationship between S and x is approximately linear, that is, 


S=ax, (5.2) 


where @ is a positive constant called the spring constant (or strength) of the spring. A 
powerful spring, such as those used in automobile suspensions, has a large value of a; the 
spring behind a doorbell has a small value of a. The formula (5.2) is called Hooke’s law’ and 
a spring that obeys Hooke’s law exactly is called a linear spring. 

The resistance force R depends on the physical process that is causing the resistance. 
For fluid resistance, the linear or quadratic resistance laws considered in Chapter 4 may be 
appropriate. However, neither of these laws represents the frictional force exerted by a rough 
table. In this chapter we assume the law of linear resistance 


R= Bu, (5.3) 


where £ is a positive constant called the resistance constant; it is a measure of the strength 
of the resistance. There is no point in disguising the fact that our major reason for assuming 
linear resistance is that (together with Hooke’s law) it leads to a linear equation of motion that 
can be solved explicitly. However, it does give insight into the general effect of all resistances, 
and actually is appropriate when the resistance arises from slow viscous flow (automobile 
shock absorbers, for instance); it is also appropriate in the electric circuit analogue, where it is 
equivalent to Ohm’s law. 
With Hooke’s law and linear resistance, the equation of motion (5.1) for the body 
becomes 
dx dx 


— x = G(t), 
hg eg ee (t) 


* The strain is the extension of the spring divided by its natural length. If the strain is large, then the linear 
approximation will break down and a non-linear approximation, such as S = ax + bx> must be used 
instead. 

+ After Robert Hooke (1635-1703). Hooke was an excellent scientist, full of ideas and a first class exper- 
imenter, but he lacked the mathematical skills to develop his ideas. When other scientists (Newton in 
particular) did so, he accused them of stealing his work and this led to a succession of bitter disputes. So 
that his rivals could not immediately make use of his discovery, Hooke first published the law that bears 
his name as an anagram on the Latin phrase ‘ut tensio, sic vis’ (as the extension, so the force). 
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where a is the spring constant, # is the resistance constant and G(f) is the prescribed driving 
force. This is a second order, linear ODE with constant coefficients for the unknown displace- 
ment x(t). We could go ahead with the solution of this equation as it stands, but the algebra is 
made much easier by introducing two new constants Q and K (instead of a and £) defined by 
the relations 


a =mQ B=2mK. 


The equation of motion for the body then becomes 


a’ son 4 ea Fy (5.4) 
dt? dt _— 


where F(t) = G(t)/m, the driving force per unit mass. This is the standard form of the 
equation of motion for the body. Any system that leads to an equation of this form is called 
a damped™ linear oscillator. When the force F(t) is absent, the oscillations are said to be 
free; when it is present, the oscillations are said to be driven. 


5.2 CLASSICAL SIMPLE HARMONIC MOTION 


A linear oscillator that is both undamped and undriven is called a classical linear 
oscillator. This is the simplest case, but arguably the most important system in physics! The 
equation (5.4) reduces to 


are 9 (5.5) 
which, because of the solutions we are about to obtain, is called the SHM equation. 


Solution procedure 


Seek solutions of the form x = e*’. Then A must satisfy the equation 
i? 4-0? =O, 


which gives A = +72. We have thus found the pair of complex solutions 


x= ga. 


which form a basis for the space of complex solutions. The real and imaginary parts of the 
first complex solution are 


cos Qt 
: — oe 
sin Qt 


* Damping is another term for resistance. Indeed, automobile shock absorbers are sometimes called 
dampers. 
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FIGURE 5.2 Classical simple harmonic motion 
x =Ccos(Qr — y). 


and these functions form a basis for the space of real solutions. The general real solution of 
the SHM equation is therefore 


x = AcosQt +B sin Qt, (5.6) 


where A and B are real arbitrary constants. This general solution can be written in the alterna- 
tive form* 


x =C cos(Qt — y), (5.7) 


where C and y are real arbitrary constants with C > 0. 


General form of the motion 


The general form of the motion is most easily deduced from the form (5.7) and is shown in 
Figure 5.2. This is called simple harmonic motion (SHM). The body makes infinitely many 
oscillations of constant amplitude C; the constant y is simply a ‘phase factor’ which shifts 
the whole graph by y/ in the ¢-direction. Since the cosine function repeats itself when the 
argument Q2r increases by 27, it follows that the period of the oscillations is given by 

21 

t=. 5.8 

© (5.8) 
The quantity Q, which is related to the frequency v by Q = 271, is called the angular 
frequency of the oscillations. 


Example 5.1 An initial value problem for classical SHM 


A body of mass m is suspended from a fixed point by a light spring and can move 
under uniform gravity. In equilibrium, the spring is found to be extended by a distance 
b. Find the period of vertical oscillations of the body about this equilibrium position. 
[Assume small strains. ] 


* This transformation is based on the result from trigonometry that a cos @ + b sin @ can always be written 
in the form c cos(@ — y), where c = (a2 + b*)!/2 and tany = b/a. 
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The body is hanging in its equilibrium position when it receives a sudden blow 
which projects it upwards with speed u. Find the subsequent motion. 


Solution 
When the spring is subjected to a constant force of magnitude mg, the extension is b. 
Hence a, the strength of the spring, is given by a = mg/b. 

Let z be the downwards displacement of the body from its equilibrium position. 
Then the extension of the spring is b + z and the restoring force is a(b + z) = 
g(b+ z)/b. The equation of motion for the body is therefore 


d*z mg(b + z) 
mg 


that is 


d*z g 
dt2 ae (5): = 0. 


This is the SHM equation with Q? = g¢/b. It follows that the period t of vertical 
oscillations about the equilibrium position is given by 


t= auld => Qn (2) 3 
Q g 


In the initial value problem, the subsequent motion must have the form 
x = AcosQt+ B sin Qt, 


where Q = (g/b)!/?. The initial condition x = 0 when t = 0 shows that A = 0 and 
the initial condition x = —u when t = 0 then gives QB = —u, that is, B = —u/Q. 
The subsequent motion is therefore 


u, 
x=- a=sinQ, 


where Q = (¢/b)!/?. m 


5.3 DAMPED SIMPLE HARMONIC MOTION 


When damping is present but there is no external force, the general equation (5.4) 
reduces to 
a son 4 ox =0 (5.9) 
—_ —_ X =U, . 
dt? dt 
the damped SHM equation. 
The solution procedure is the same as in the last section. Seek solutions of the form 
x =e”, Then A must satisfy the equation 


24+2KAi4+ 97 =0, 
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that is 
O+kKyY=K?= 0°, 


We see that different cases arise depending on whether K < Q, K = Qor K > Q. These 
cases give rise to different kinds of solution and must be treated separately. 


Under-damping (sub-critical damping): K < Q 
In this case, we write the equation for A in the form 
(A+K)* =-Q%, 


where Qp = (22 _ Key a positive real number. The values are then A = —K +iQp. 
We have thus found the pair of complex solutions 


es oe Kt etiQpt 


’ 


which form a basis for the space of complex solutions. The real and imaginary parts of the 
first complex solution are 


and these functions form a basis for the space of real solutions. The general real solution of 
the damped SHM equation in this case is therefore 


x =e *' (AcosQpt + BsinQpt), (5.10) 


where A and B are real arbitrary constants. This general solution can be written in the alterna- 
tive form 


x = Ce *' cos(Qpt — y), (5.11) 
where C and y are real arbitrary constants with C > 0. 


General form of the motion 


The general form of the motion is most easily deduced from the form (5.11) and is shown 
in Figure 5.3. This is called under-damped SHM. The body still executes infinitely many 
oscillations, but now they have exponentially decaying amplitude Ce~*'. Suppose the period 
t of the oscillations is defined as shown in Figure 5.3.* The introduction of damping decreases 
the angular frequency of the oscillations from Q to Qp, which increases the period of the 
oscillations from 277 / Q to 

20 20 


C= Qp = (Q2 — K2) 172" (5.12) 


* The period might also be defined as the time interval between successive maxima of the function x(t). 
Since these maxima do not occur at the points at which x(t) touches the bounding curves, it is not obvious 
that this time interval is even a constant. However, it is a constant and has the same value as (5.12) (see 
Problem 5.5) 
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FIGURE 5.3 Under-damped simple harmonic motion 
x =Ce-*' cos(Qpt — y). 


Over-damping (super-critical damping): K > Q 
In this case, we write the equation for A in the form 
(A+ K)? = 6’, 


where 6 = (K?2 — ?)!/2, a positive real number. The i values are then A = —k + 6, which 
are now real. We have thus found the pair of real solutions 


x= e Ktytst 


which form a basis for the space of real solutions. The general real solution of the damped 
SHM equation in this case is therefore 


x=e*t (Ae*! + Be") ; (5.13) 


where A and B are real arbitrary constants. 


General form of the motion 


Three typical forms for the motion are shown in Figure 5.4. This is called over-damped SHM. 
Somewhat surprisingly, the body does not oscillate at all. For example, if the body is released 
from rest, then it simply drifts back towards the equilibrium position. On the other hand, if 
the body is projected towards the equilibrium position with sufficient speed, then it passes the 
equilibrium position once and then drifts back towards it from the other side. 


Critical damping: K = Q 


The case of critical damping is solved in Problem 5.6. Qualitatively, the motions look like 
those in Figure 5.4. 
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FIGURE 5.4 Three typical cases of over-damped simple 
harmonic motion. 


5.4 DRIVEN (FORCED) MOTION 


We now include the effect of an external driving force G(¢) which we suppose to be 
a given function of the time. In the case of a body suspended by a spring, we could apply such 
a force directly, but, in practice, the external ‘force’ often arises indirectly by virtue of the 
suspension point being made to oscillate in some prescribed way. The seismograph described 
in the next section is an instance of this. 
Whatever the origin of the driving force, the governing equation for driven motion is 
(5.4), namely 


d2x dx 
— + 2K — 4 Ox = Fi), 5.14 
wo a (t) (5.14) 


where 2mK is the damping constant, m2? is the spring constant and mF (t) is the driving 
force. Since this equation is linear and inhomogeneous, its general solution is the sum of (i) 
the general solution of the corresponding homogeneous equation (5.9) (the complementary 
function) and (ii) any particular solution of the inhomogeneous equation (5.14) (the partic- 
ular integral). The complementary function has already been found in the last section, and 
it remains to find the particular integral for interesting choices of F(t). Actually there is a 
(rather complicated) formula for a particular integral of this equation for any choice of the 
driving force mF (t). However, the most important case by far is that of time harmonic forc- 
ing and, in this case, it is easier to find a particular integral directly. Time harmonic forcing is 
the case in which 


F(t) = Focos pt, (5.15) 


where Fo and p are positive constants; m Fo is the amplitude of the applied force and p is its 
angular frequency. 
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Solution procedure 


We first replace the forcing term Fo cos pt by its complex counterpart Foe'?". This gives the 
complex equation 


dx dx a iat 


We then seek a particular integral of this complex equation in the form 
x = cel? (5.17) 


where c is a complex constant called the complex amplitude. On substituting (5.17) into 
equation (5.16) we find that 


- ail (5.18) 
© 2 = p24 2iKp’ 
so that the complex function 
F ipt 
: (5.19) 


Q? — p? +2iKp 


is a particular integral of the complex equation (5.16). A particular integral of the real equa- 
tion (5.14) is then given by the real part of the complex expression (5.19). It follows that a 
particular integral of equation (5.14) is given by 


xP =a cos(pt — y), 


where a = |c| and y = — argc. This particular integral, which is also time harmonic with the 
same frequency as the applied force, is called the driven response of the oscillator to the force 
mFo cos pt; a is the amplitude of the driven response and y (0 < y < 7) is the phase angle 
by which the response lags behind the force. From the expression (5.18) for c, it follows that 


F 2K 
The general solution of equation (5.14) therefore has the form 
x =acos(pt —y) +x", (5.21) 
where x©" is the complementary function, that is, the general solution of the corresponding 


undriven problem. 

The undriven problem has already been solved in the last section. The solution took three 
different forms depending on whether the damping was supercritical, critical or subcritical. 
However, all these forms have one feature in common, that is, they all decay to zero with 
increasing time. For this reason, the complementary function for this equation is often called 
the transient response of the oscillator. Any solution of equation (5.21) is therefore the sum 
of the driven response x? (which persists) and a transient response x©" (which dies away). 
Thus, no matter what the initial conditions, after a sufficently long time we are left with just 
the driven response. In many problems, the transient response can be disregarded, but it must 
be included if inital conditions are to be satisfied. 
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Example 5.2 An initial value problem for driven motion 


The equation of motion of a certain driven damped oscillator is 


ax dx 
ae a = 10cost 


and initially the particle is at rest at the origin. Find the subsequent motion. 


Solution 


First we find the driven response x?. The complex counterpart of the equation of 
motion is 


d?x dx 
—— + 3— 42x = 10e" 
dt? 7 dt ae ° 


and we seek a solution of this equation of the form x = ce!’. On substituting in, we 
find that 


10 
c= 
1+3i 


= 1—-3i. 
It follows that the driven response x” is given by 
x? =m [(1 ie] = cost +3 sins, 


Now for the complementary function x°". This is the general solution of the 
corresponding undriven equation 


d?x + at 42 0 
ead aos x = 0, 
dt? dt 


which is easily found to be 
x = Ae + Be~*!, 


where A and B are arbitrary constants. The general solution of the equation of 
motion is therefore 


x =cost+3sint + Ae’ + Be". 


It now remains to choose A and B so that the initial conditions are satisfied. The 
condition x = 0 when ¢ = 0 implies that 


0=1+A+8B, 
and the condition x = 0 when t = 0 implies that 


0=3-—A-—2B. 
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>t 
FIGURE 5.5 The solid curve is the actual 
response and the dashed curve the driven 
response only. 
Solving these simultaneous equations gives A = —5 and B = 4. The subsequent 


motion of the oscillator is therefore given by 


x = cost+3sint —Se' + 4e~7". 
ee 
driven response __ transient response 
This solution is shown in Figure 5.5 together with the driven response only. In this 
case, the transient response is insignificant after less than one cycle of the driving 


force. The amplitude of the driven reponse is (17 + 37)!/* = \/10 and the phase lag 
is tan! (3/1) © 72°. 


Resonance of an oscillating system 
Consider the general formula 
Fo 
((Q2 _ p?)2 4 4K? p?) 


1/2 


for the amplitude a of the driven response to the force mF cos pt (see equation (5.20)). Sup- 
pose that the amplitude of the applied force, the spring constant, and the resistance constant 
are held fixed and that the angular frequency p of the applied force is varied. Then a is a 
function of p only. Which value of p produces the largest driven response? Let 


f@ = (2? — 4) +4K7¢. 


Then, since a = Fo/,/ f (p2), we need only find the minimum point of the function f (q) lying 
ing > 0. Now 


f'(q) = -2(2? —q) + 4K? =2 (4 AS 2k?)) 


so that f(q) decreases for g < Q? — 2K? and increases for g > Q? — 2K?. Hence f(g) has 
a unique minimum point at ¢g = Q? — 2K. Two cases arise depending on whether this value 
is positive or not. 


Case 1. When Q? > 2K?, the minimum point g = Q? — 2K? is positive and a has its 
maximum value when p = p*, where 


a = (Q? _ 2K?) 


116 Chapter 5 Linear oscillations 


L > Q 
' p/ 


FIGURE 5.6 The dimensionless amplitude (Fo/ Q”)a 
against the dimensionless driving frequency p/Q for 
(from the top) K /Q = 0.1, 0.2, 0.3, 1. 


The angular frequency p* is called the resonant frequency of the oscillator. The 
value of a at the resonant frequency is 


Fo 
2K (22 — K2)1/2" 


aAmax = 


Case 2. When Q? < 2K”, a is a decreasing funcion of p for p > 0 so that a has no maximum 
point. 


These results are illustrated in Figure 5.6. They are an example of the general physical 
phenomenon known as resonance, which can be loosely stated as follows: 


The phenomenon of resonance 


Suppose that, in the absence of damping, a physical system can perform free oscillations 
with angular frequency (2. Then a driving force with angular frequency p will induce a 
large response in the system when p is close to Q, providing that the damping is not too 
large. 


This principle does not just apply to the mechanical systems we study here. It is a general 
physical principle that also applies, for example, to the oscillations of electric currents in 
circuits and to the quantum mechanical oscillations of atoms. 

Note that the resonant frequency p* is always less than Q, but is close to Q when K /Q 
is small. The height of the resonance peak, dmax, iS given approximately by 


i. (oy 
OE Ta 
in the limit in which K /Q is small; dmax therefore tends to infinity in this limit. In the same 


limit, the width of the resonance peak is directly proportional to K / Q and consequently tends 
to zero. 


5.4 Driven (forced) motion 117 


General periodic driving force 


The method we have developed for the time harmonic driving force can be extended to any 
periodic driving force mF (t). A function f(t) is said to be periodic with period Tt if the values 
taken by f in any interval of length t are then repeated in the next interval of length t. An 
example is the ‘square wave’ function shown in Figure 5.7. The solution method requires that 
F (t) be expanded as a Fourier series.* A textbook on mechanics is not the place to develop 
the theory of Fourier series. Instead we will simply quote the essential results and then give an 
example of how the method works. To keep the algebra as simple as possible, we will suppose 
that the driving force has period 27." 


Fourier’s Theorem 


Fourier’s theorem states that any function f(t) that is periodic with period 27 can be 
expanded as a Fourier series in the form 


Co 
f= 50 ls a, cosnt + by sinnt, (5.22) 


n=l 


where the Fourier coefficients {a,} and {b,} are given by the formulae 


dn = -| f (t) cosnt dt, Dp = =| f(@) sinnt dt. (5.23) 


5 ih 


What this means is that amy function f(t) with period 27 can be expressed as a sum of 
time harmonic terms, each of which has period 27. In order to find the driven response of 
the oscillator when the force mF (t) is applied, we first expand F(t) in a Fourier series. We 
then find the driven response that would be induced by each of the terms of this Fourier series 
applied separately, and then simply add these responses together. The method depends on the 
equation of motion being /inear. 


Example 5.3 Periodic non-harmonic driving force 


Find the driven response of the damped linear oscillator 


d?x dx 
—— £ 2K — Px = FG 
7 + Pp + O°x (t) 


for the case in which F(t) is periodic with period 27 and takes the values 


Fo (0<t <7), 
—Fo (1% <t < 2m), 


F(t) =| 


* After Jean Baptiste Joseph Fourier 1768-1830. The memoir in which he developed the theory of trigono- 
metric series ‘On the Propagation of Heat in Solid Bodies’ was submitted for the mathematics prize of 
the Paris Institute in 1811; the judges included such luminaries as Lagrange, Laplace and Legendre. They 
awarded Fourier the prize but griped about his lack of mathematical rigour. 

* The general case can be reduced to this one by a scaling of the unit of time. 
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FIGURE 5.7 The ‘square wave’ input function F(t) is periodic with 
period 27. Its value alternates between + Fo. 


in the interval 0 < t < 27. This function* is shown in Figure 5.7. 


Solution 


The first step is to find the Fourier series of the function F(t). From the formula 
(5.23), the coefficient a, is given by 


i 77 1 ¥° ae 
a, = -| F(t) cosnt dt = — (—Fo) cosnt dt + — / (+ Fo) cos nt dt 
om Pia aA) 


1 = 


=0 


since both integrals are zero for n > | and are equal and opposite when n = 0. In the 
same way, 


a i i 1 f* 
b, = - | F(t)sinntdt = ~| (—Fo) sinnt dt + al (+ Fo) sinnt dt 
WT Jen T J—nx wT JO 
2Fo [* 
=! sin nt dt, 
mz Jo 


since this time the two integrals are equal. Hence 


; 2Fo j—"] 2Fo ¢ = orn) 
: 5 = 
a 


n 0 IU n 


_ 2Fo (1—(-1)" 
B(EE2) 


Hence the Fourier series of the function F (t) is 


Ftj= > on (——) sinnt. 


n=1 


* This function is the mechanical equivalent of a ‘square wave input’ in electric circuit theory. 
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2n/p ~27/p 2Qn/p 


FIGURE 5.8 Driven response of a damped oscillator to the alternating constant force +m Fo with 
angular frequency p: Left Q/p = 1.5, K/p = 1. Right Q/p = 2.5, K/p = 0.1. The light 
graphs show the first term of the exapansion series. 


The next step is to find the driven response of the oscillator to the force 
m(by sinnt), that is, the particular integral of the equation 


d*x dx 2 : 
qa x = by, sinnt. (5.24) 
The complex counterpart of this equaton is 

ay 2 dx Q2 - b int 

Goa 


int 


for which the particular integral is ce’”’, where the complex amplitude c is given by 


bn 
Q?2 — n2 +21 K- 


C= 


The particular integral of the real equation (5.24) is then given by 


. b,ei™! ; (Q2 — n*) sinnt + 2Kncosnt 
aS —— se = é 
Q2 —n24+2i1Kn : (Q2 — n2)2 + 4K2n2 
Finally we add together these separate responses to find the driven response of 


the oscillator to the force mF (t). On inserting the value of the coefficient b,, this 
gives 


= 2Fo 5 1 —(-1)" (Q?2 — n*) sinnt + 2Kncosnt (5.25) 
a n (Q2 — n2)?2 + 4K?2n2 


n=1 


In order to deduce anything from this complicated formula, we must either sum 
the series numerically or approximate the formula in some way. When Q and K are 
both small compared to the forcing frequency p, the series (5.25) converges quite 
quickly and can be approximated (to within a few percent) by the first term. Even 
when Q/p = 1.5 and K/p = 1, this is still a reasonable approximation (see Figure 
5.8 (left)). However, for larger values of Q2/p, the higher harmonics in the Fourier 
expansion of F(t) that have frequencies close to Q produce large contributions (see 
Figure 5.8 (right)). In this case, the series (5.25) must be summed numerically. 
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Equilibrium position ae 


FIGURE 5.9 A simple seismograph for measuring vertical ground motion. 


5.5 A SIMPLE SEISMOGRAPH 


The seismograph is an instrument that measures the motion of the ground on which it 
stands. In real earthquakes, the ground motion will generally have both vertical and horizontal 
components, but, for simplicity, we describe here a device for measuring vertical motion only. 

Our simple seismograph (see Figure 5.9) consists of a mass which is suspended from a 
rigid support by a spring; the motion of the mass relative to the support is resisted by a damper. 
The support is attached to the ground so that the suspension point has the same motion as 
the ground below it. This motion sets the suspended mass moving and the resulting spring 
extension is measured as a function of the time. Can we deduce what the ground motion was? 

Suppose the ground (and therefore the support) has downward displacement X (t) at time 
t and that the extension x(t) of the spring is measured from its equilibrium length. Then the 
displacement of the mass is x + X, relative to an inertial frame. The equation of motion (5.9) 
is therefore modified to become 


d?(x +X) dx 5 
that is, 
dx dx 2 dx 
sz i+ 2K + Fx = -—z. 
eo ge di? 


This means that the motion of the body relative to the moving support is the same as if the 
support were fixed and the external driving force —m(d*X /dt*) were applied to the body. 

First consider the driven response of our seismograph to a train of harmonic waves with 
amplitude A and angular frequency p, that is, 


X = Acos pt. 
The equation of motion for the spring extension x is then 


d? d 
= + 2K — +x = Ap’ cos pt. 
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m ™m 
FIGURE 5.10 Two particles are connected between three springs 
and perform longitudinal oscillations. 


The complex amplitude of the driven motion is 


pea 
c= ; 
—p?+2iKp +92 


and the real driven motion is 
x =acos(pt—y), 


where 


A 
i= |—1 + 2i(K/p) + (Q/p)?| 


a=|c (5.26) 
Thus, providing that the spring and resistance constants are accurately known, the angular 
frequency p and amplitude A of the incident wave train can be deduced. 

In practice, things may not be so simple. In particular, the incident wave train may be a 
mixture of harmonic waves with different amplitudes and frequencies, and these are not easily 
disentangled. However, if K and Q are chosen so that K /p and Q2/p are small compared with 
unity (for all likely values of p), then c = —A and X = —x approximately. Thus, in this 
case, the record for x(t) is simply the negative of the ground motion X (t).* Since this result 
is independent of the incident frequency, it should also apply to complicated inputs such as a 
pulse of waves. 


5.6 COUPLED OSCILLATIONS AND NORMAL MODES 


Interesting new effects occur when two or more oscillators are coupled together. Fig- 
ure 5.10 shows a typical case in which two bodies are connected between three springs and 
the motion takes place in a straight line. We restrict ourselves here to the classical theory in 
which the restoring forces are linear and damping is absent. If the springs are non-linear, then 
the displacements of the particles must be small enough so that the linear approximation is 
adequate. 

Let x and y be the displacements of the two bodies from their respective equilibrium 
positions at time t; because two coordinates are needed to specify the configuration, the system 
is said to have two degrees of freedom. Then, at time ft, the extensions of the three springs are 
x, y — x and —y respectively. Suppose that the strengths of the three springs are a, 2a and 


* What is actually happening is that the mass is hardly moving at all (relative to an inertial frame). 
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4a respectively. Then the three restoring forces are ax, 2a(y — x), —4ay and the equations of 
motion for the two bodies are 


mx = —ax + 2a(y — x), 


my = —2a(y — x) — 4ay, 
which can be written in the form 


¥ + 3n?x — 2n?y = 0, (6.27) 
jj —2n?x + 6n?y = 0, , 


where the positive constant n is defined by n? = a/m. These are the governing equations 
for the motion. They are a pair of simultaneous second order homogeneous linear ODEs with 
constant coefficients. The equations are coupled in the sense that both unknown functions 
appear in each equation; thus neither equation can be solved on its own. 


The solution procedure: normal modes 


The solution procedure is simply an extension of the usual method for finding the comple- 
mentary function for a single homogeneous linear ODE with constant coefficients. However, 
rather than seek solutions in exponential form, it is simpler to seek solutions directly in the 
trigonometric form 


x = Acos(@t —y), 
(5.28) 
y = Bcos(at — y), 


where A, B, w and y are constants. A solution of the governing equations (5.27) that has 
the form (5.28) is called a normal mode of the oscillating system. In a normal mode, all the 
coordinates that specify the configuration of the system vary harmonically in time with the 
same frequency and the same phase; however, they generally have different amplitudes. On 
substituting the normal mode form (5.28) into the governing equations (5.27), we obtain 


~—w* A cos(wt — y)+ 3n? A cos(wt — y)- 2n?B cos(wt — y) = 0, 
—w* B cos(ot — y)- 2n A cos(wt — Vit 6n7B cos(wt — y) = 0, 


which simplifies to give 


(3n? —w*)A—2n?B =O, 


—2n*A + (6n* — w*)B = 0, ie 
a pair of simultaneous linear algebraic equations for the amplitudes A and B. Thus a normal 
mode will exist if we can find constants A, B and w so that the equations (5.29) are satisfied. 
Since the equations are homogeneous, they always have the trivial solution A = B = 0, 
whatever the value of w. However, the trivial solution corresponds to the equilibrium solution 
x = y = Oof the governing equations (5.27), which is not a motion at all. We therefore require 
the equations (5.29) to have a non-trivial solution for A, B. There is a simple condition that 
this should be so, namely that the determinant of the system of equations should be zero, that 
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is, 


3n2 —@? —2n? 
det a ae = 0. (5.30) 


On simplification, this gives the condition 
wo — 9n?a* + 14n* = 0, (5.31) 


a quadratic equation in the variable w. If this equation has real positive roots wr, ws, then, 
for each of these values, the linear equations (5.29) will have a non-trivial solution for the 
amplitudes A, B. In the present case, the equation (5.31) factorises and the roots are found to 
be 


wo = 2n?, ws =7n’. (5.32) 


Hence there are two normal modes with (angular) frequencies /2n and \/7n respectively. 
These frequencies are known as the normal frequencies of the oscillating system. 


Slow mode: In the slow mode we have w* = 2n? so that the linear equations (5.29) become 
n2A —2n2B = 0, 
—~2n*A + 4n?B = 0. 


These two equations are each equivalent to the single equation A = 2B. This is to be expected 
since, if the equations were linearly independent, then there would be no non-trivial solution 
for A and B. We have thus found a family of non-trivial solutions A = 26, B = 6, where 6 can 
take any (non-zero) value. Thus the amplitude of the normal mode is not uniquely determined; 
this happens because the governing ODEs are linear and homogeneous. The slow normal 
mode therefore has the form 


x = 25cos(/V2nt — vy), 


y = dcos(V2nt — y), (5.33) 


where the amplitude factor 5 and phase factor y can take any values. We see that, in the slow 
mode, the two bodies always move in the same direction with the body on the left having twice 
the amplitude as the body on the right. 


Fast mode: In the fast mode we have w* = 7n and, by following the same procedure, we 
find that the form of the fast normal mode is 


x= 8cos(V7nt — y); 


y —26 cos(V/7 nt —y), (5.34) 


where the amplitude factor 5 and phase factor y can take any values. We see that, in the fast 
mode, the two bodies always move in opposite directions with the body on the right having 
twice the amplitude as the body on the left. 


The general motion 


Since the governing equations (5.27) are linear and homogeneous, a sum of normal mode 
solutions is also a solution. Indeed, the general solution can be written as a sum of normal 
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FIGURE 5.11 The two particles are attached to a light stretched 
string and perform small transverse oscillations. The 
displacements are shown to be large for clarity. 


modes. Consider the expression 


x = 26; cos(/2 nt —-y)+ 42 cos(V/7 nt — 2), 


(5.35) 
y= 6, cos(V2nt — y1) — 282 cos(7 nt — y2). 
This is simply a sum of the first normal mode (with amplitude factor 6; and phase factor 7) 
and the second normal mode (with amplitude factor 52 and phase factor y2). Since it is possible 
to choose these four arbitrary constants so that x, y, x, y take any set of assigned values when 
t = 0, this must be the general solution of the governing equations (5.27). 


Question Periodicity of the general motion 
Is the general motion periodic? 


Answer 

The general motion is a sum of normal mode motions with periods T,, tz respectively. 
This sum will be periodic with period t if (and only if) t is an integer multiple of both 
T, and 79, that is, if tT; /t2 1s a rational number. (In this case, the periods are said to be 
commensurate.) This in turn requires that w;/q is a rational number. In the present 
case, @,/@2 = (2/7)'/ 2 which is irrational. The general motion is therefore not 
periodic in this case. 

We conclude by solving another typical normal mode problem. 


Example 5.4 Small transverse oscillations 


Two particles P and Q, of masses 2m and m, are secured to a light string that is 
stretched to tension Tg between two fixed supports, as shown in Figure 5.11. The 
particles undergo small transverse oscillations perpendicular to the equilibrium line 
of the string. Find the normal frequencies, the forms of the normal modes, and the 
general motion of this system. Is the general motion periodic? 
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Solution 


First we need to make some simplifying assumptions.* We will assume that the trans- 
verse displacements x, y of the two particles are small compared with a; the three 
sections of the string then make small angles with the equilibrium line. We will also 
neglect any change in the tensions of the three sections of string. 

The left section of string then has constant tension To. When the particle P is 
displaced, this tension force has the transverse component —7o sin@, which acts as 
a restoring force on P; since @ is small, this component is approximately —To.x /a. 
Similar remarks apply to the other sections of string. The equations of transverse 
motion for P and Q are therefore 

ami — To , To - 2) 

a a 
To(y —*x) _ Toy 
a 2a~ 


my = 

which can be written in the form 
2% + 2n?x —n’y =0, (5.36) 
2% —2n*x + 3n7y =0, (5.37) 


where the positive constant n is defined by n* = Tp/ma. 
These equations will have normal mode solutions of the form 


x = Acos(wt —y), 
y = Bcos(at — y), 
when the simultaneous linear equations 
(2n* — 2w*)A — n?B =(), 
(5.38) 
—2n7A + (3n? — 2w”)B = 0, 


have a non-trivial solution for the amplitudes A, B. The condition for this is 
on? = 20" =n? 
det ( wh. agit al = 0. (5.39) 
On simplification, this gives 
204 — 5n?w? + 2n* = 0, (5.40) 


a quadratic equation in the variable w* 


found to be 


. This equation factorises and the roots are 


wr = sn’, ws = 2n’. (5.41) 


Hence there are two normal modes with normal frequencies n //2 and /2n respec- 
tively. 


* These assumptions are consistent with the more complete treatment given in Chapter 15. 
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Slow mode: In the slow mode we have w” = n? /2 so that the linear equations (5.38) 


become 
n2A —n?B =0, 
—2n7A + 2n?B = 0. 


These two equations are each equivalent to the single equation A = B so that we have 
the family of non-trivial solutions A = 6, B = 6, where 6 can take any (non-zero) 
value. The slow normal mode therefore has the form 


x = dcos(nt//2—y), 


y = dcos(nt//2 — y), (5.42) 


where the amplitude factor 5 and phase factor y can take any values. We see that, in 
the slow mode, the two particles always have the same displacement. 


Fast mode: In the fast mode we have w* = 2n? and, by following the same proce- 
dure, we find that the form of the fast normal mode is 


x= dcos(V2nt — y), 


y = —25 cos(V/2nt — y), (5.43) 


where the amplitude factor 5 and phase factor y can take any values. We see that, 
in the fast mode, the two particles always move in opposite directions with Q having 
twice the amplitude of P. 

The general motion is now the sum of the first normal mode (with amplitude 
factor 6; and phase factor y;) and the second normal mode (with amplitude factor 5 
and phase factor y2). This gives 


x = 8, cos(nt/V2— 1) + 52. c0s(V2nt — y2), his 
y=} cos(nt //2 —y1) — 262 cos(./2nt — 2). , 


For this system t1/t2 = w2/@, = 2 so that the general motion is periodic with 
period t) = 2/27/n. @ 


Problems on Chapter 5 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Free linear oscillations 


5.1 A certain oscillator satisfies the equation 


¥+4x =0. 


Initially the particle is at the point x = /3 when it is projected towards the origin with speed 
2. Show that, in the subsequent motion, 


x = V73cos2t — sin2r. 
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Deduce the amplitude of the oscillations. How long does it take for the particle to first reach 
the origin? 


5.2 When a body is suspended from a fixed point by a certain linear spring, the angular 
frequency of its vertical oscillations is found to be 1. When a different linear spring is used, 
the oscillations have angular frequency (22. Find the angular frequency of vertical oscillations 
when the two springs are used together (i) in parallel, and (11) in series. Show that the first of 
these frequencies is at least twice the second. 


5.3 A particle of mass m moves along the x-axis and is acted upon by the restoring force 
—m(n? + k?)x and the resistance force —2mkx, where n, k are positive constants. If the 
particle is released from rest at x = a, show that, in the subsequent motion, 


a ; 
x = —e(ncosnt +k sinnt). 
n 
Find how far the particle travels before it next comes to rest. 


5.4 An overdamped harmonic oscillator satisfies the equation 
X¥ + 10x + 16x = 0. 


At time f = 0 the particle is projected from the point x = | towards the origin with speed u. 
Find x in the subsequent motion. 
Show that the particle will reach the origin at some later time f if 


u—2 61 


u—8 


How large must u be so that the particle will pass through the origin? 


5.5 A damped oscillator satisfies the equation 
#+2Ki+ 07x =0 


where K and Q are positive constants with K < Q (under-damping). At time tf = 0 the 
particle is released from rest at the point x = a. Show that the subsequent motion is given by 


—Kt Re os 
x = ae cos Qpt + —sinQpt }, 
QD 


where Qp = (Q? — K?)!/2, 

Find all the turning points of the function x(t) and show that the ratio of successive max- 
imum values of x is e~27*/20, 

A certain damped oscillator has mass 10 kg, period 5 s and successive maximum values 
of its displacement are in the ratio 3 : 1. Find the values of the spring and damping constants 
a and B. 


5.6 Critical damping Find the general solution of the damped SHM equation (5.9) for the 
special case of critical damping, that is, when K = &. Show that, if the particle is initially 
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released from rest at x = a, then the subsequent motion is given by 
x =ae™ (14+ 91). 


Sketch the graph of x against fr. 


5.7% Fastest decay The oscillations of a galvanometer satisfy the equation 
#+2Ki+ 07x =0. 


The galvanometer is released from rest with x = a and we wish to bring the reading perma- 
nently within the interval —ea < x < ea as quickly as possible, where € is a small positive 
constant. What value of K should be chosen? One possibility is to choose a sub-critical value 
of K such that the first minimum point of x(t) occurs when x = —ea. [Sketch the graph of 
x(t) in this case.] Show that this can be acheived by setting the value of K to be 


54 -1/2 
K=Q]1+ (5 ) 
In(1/e) : 


If K has this value, show that the time taken for x to reach its first minimum is approximately 
Q7! In(1/e) when € is small. 


5.8 A block of mass M is connected to a second block of mass m by a linear spring of natural 
length 8a. When the system is in equilibrium with the first block on the floor, and with the 
spring and second block vertically above it, the length of the spring is 7a. The upper block is 
then pressed down until the spring has half its natural length and is then resleased from rest. 
Show that the lower block will leave the floor if M < 2m. For the case in which M = 3m/2, 
find when the lower block leaves the floor. 


Driven linear oscillations 


5.9 A block of mass 2 kg is suspended from a fixed support by a spring of strength 
2000 Nm~!. The block is subject to the vertical driving force 36cos pt N. Given that the 
spring will yield if its extension exceeds 4 cm, find the range of frequencies that can safely be 
applied. 


5.10 A driven oscillator satisfies the equation 
¥ + 07x = Focos[Q(1 + €)t], 
where € is a positive constant. Show that the solution that satisfies the initial conditions x = 0 
and x = 0 when t = 0 is 
Fo ee | . 1 
=i i ze Qt sinQ(1 + ze). 
e(1 + 7€)Q 


Sketch the graph of this solution for the case in which € is small. 


5.11 Figure 5.12 shows a simple model of a car moving with constant speed c along a gently 
undulating road with profile h(x), where h’(x) is small. The car is represented by a chassis 
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FIGURE 5.12 The car moves along a gently undulating road. 


which keeps contact with the road, connected to an upper mass m by a spring and a damper. 
At time ¢ the upper mass has displacement y(t) above its equilibrium level. Show that, under 
suitable assumptions, y satisfies a differential equation of the form 


yt2Ky+ OQ? y =2Kch'(ct) + 27h (ct) 


where K and Q are positive constants. 
Suppose that the profile of the road surface is given by h(x) = hocos(px/c), where ho 
and p are positive constants. Find the amplitude a of the driven oscillations of the upper mass. 
The vehicle designer adjusts the damper so that K = Q. Show that 


< 2 | 

as =n", 
V3 

whatever the values of the consants Q and p. 


5.12 Solution by Fourier series A driven oscillator satisfies the equation 
¥4+2Ki+ 07x = F(0), 


where K and Q are positive constants. Find the driven response of the oscillator to the saw 
tooth’ input, that is, when F'(t) is given by 


F(t) = Fot (-a <t <7) 


and F(t) is periodic with period 27. [It is a good idea to sketch the graph of the function 
F(t).] 


Non-linear oscillations that are piecewise linear 


5.13 A particle of mass m is connected to a fixed point O on a smooth horizontal table by 
a linear elastic string of natural length 2a and strength mQ?. Initially the particle is released 
from rest at a point on the table whose distance from O is 3a. Find the period of the resulting 
oscillations. 


5.14 Coulomb friction The displacement x of a spring mounted mass under the action of 
Coulomb friction satisfies the equation 


—Fo x>0 


as 27; = 
sas Fy <0 
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where Q and Fo are positive constants. If |x| > Fo/Q? when x = 0, then the motion con- 
tinues; if |x| < Fo/Q? when x = 0, then the motion ceases. Initially the body is released 
from rest with x = 9F9/2Q7. Find where it finally comes to rest. How long was the body in 
motion? 


5.15 A partially damped oscillator satisfies the equation 
H+ 2% +N?x =0, 


where {2 is a positive constant and x is given by 


where K is a positive constant such that K < Q. Find the period of the oscillator and the ratio 
of successive maximum values of x. 


Normal modes 


5.16 A particle P of mass 3m is suspended fron a fixed point O by a light linear spring with 
strength a. A second particle Q of mass 2m is in turn suspended from P by a second spring of 
the same strength. The system moves in the vertical straight line through O. Find the normal 
frequencies and the form of the normal modes for this system. Write down the form of the 
general motion. 


5.17 Two particles P and Q, each of mass m, are secured at the points of trisection of a 
light string that is stretched to tension Ty between two fixed supports a distance 3a apart. The 
particles undergo small transverse oscillations perpendicular to the equlilibrium line of the 
string. Find the normal frequencies, the forms of the normal modes, and the general motion of 
this system. [Note that the forms of the modes could have been deduced from the symmetry 
of the system.] Is the general motion periodic? 


5.18 A particle P of mass 3m is suspended from a fixed point O by a light inextensible string 
of length a. A second particle Q of mass m is in turn suspended from P by a second string of 
length a. The system moves in a vertical plane through O. Show that the linearised equations 
of motion for small oscillations near the downward vertical are 


46 +6+4n76 =0, 


6+6+n'¢ =0, 


where @ and @ are the angles that the two strings make with the downward vertical, and n? = 
g/a. Find the normal frequencies and the forms of the normal modes for this system. 


Chapter Six 


Energy conservation 


KEY FEATURES 
The key features of this chapter are the energy principle for a particle, conservative fields of 
force, potential energies and energy conservation. 


In this Chapter, we introduce the notion of mechanical energy and its conservation. 
Although energy methods are never indispensible* for the solution of problems, they do give 
a greater insight and allow many problems to be solved in a quick and elegant manner. Energy 
has a fundamental réle in the Lagrangian and Hamiltonian formulations of mechanics. More 
generally, the notion of energy has been so widely extended that energy conservation has 
become the most pervasive and important principle in the whole of physics. 


6.1 THE ENERGY PRINCIPLE 


Suppose a particle P of mass m moves under the influence of a force F. Then its 
equation of motion is 


dv 
m— =F, 6.1 

rr (6.1) 
where v is the velocity of P at time rf. At this stage we place no restrictions on the force F. 
It may depend on the position of P, the velocity of P, the time, or anything else; if more than 
one force is acting on P, then F means the vector resultant of these forces. On taking the 
scalar product of both sides of equation (6.1) with v, we obtain the scalar equation 


dv 
mve—=F-v 
dt 


and, since 


* Energy is never mentioned in the work of Newton! 
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this can be written in the form 
dT 
—=. F . v, 6.2 
Th (6.2) 
where T = smv vu. 
Definition 6.1 Kinetic energy The scalar quantity T = smv -v= sm|v|* is called the 
kinetic energy of the particle P. 


If we now integrate equation (6.2) over the time interval [f;, t2], we obtain 


t2 
nn = | F.-vdt (6.3) 
ty 


where 7) and T> are the kinetic energies of P at times f; and f2 respectively. This is the energy 
principle for a particle moving under a force F. 


Definition 6.2 1-D work done The scalar quantity 
t2 
w= | F -vdt (6.4) 
ty 


is called the work done by the force F during the time interval [t,, t2]. The rate of working of 
F at time t is thus F + v. 
[The SI unit of work is the joule (J) and one joule per second is one watt (W).] 


Our result can now be stated as follows: 


Energy principle for a particle 


In any motion of a particle, the increase in the kinetic energy of the particle in a given 
time interval is equal to the total work done by the applied forces during this time inter- 
val. 


The energy principle is a scalar equality which is derived by integrating the vector equa- 
tion of motion (6.1). Thus the energy principle will generally contain less information than 
the equation of motion, so that we have no right to expect the motion of P to be determined 
from the energy principle alone. The situation is simpler when P has one degree of freedom, 
which means that the position of P can be specified by a single scalar variable. In this case 
the equation of motion and the energy principle are equivalent and the energy principle alone 
is sufficient to determine the motion. 


Example 6.1 Verify the energy principle 
A man of mass 100 kg can pull on a rope with a maximum force equal to two fifths 


of his own weight. [ Take g = 10 ms~?. ] In a competition, he must pull a block of 
mass 1600 kg across a smooth horizontal floor, the block being initially at rest. He is 
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able to apply his maximum force horizontally for 12 seconds before falling exhausted. 
Find the total work done by the man and confirm that the energy principle is true in 
this case. 


Solution 
In this problem, the block is subjected to three forces: the force exerted by the man, 
uniform gravity, and the vertical reaction of the smooth floor. However, since the last 
two of these are equal and opposite, they can be ignored. 

The man has weight 1000 N so that the force he applies to the block is a constant 
200 N. The Second Law then implies that, while the man is pulling on the rope, the 
block must have constant rectilinear acceleration 200/1600 = 1/8 ms~?. Since the 
block is initially at rest, its velocity v at time t is therefore v = t/8 ms~!. The total 
work W done by the man is then given by the formula (6.4) to be 


12 12 t 
w= Fevdr= | 200( 5) ar = 18003. 
0 0 8 


When t = 12 s, the block has velocity v = 12/8 = 3/2 ms™!, so that the final kinetic 
energy of the block is 5 (1600) (3 /2)* = 1800 J. Since the initial kinetic energy of the 
block is zero, the kinetic energy of the block increases by 1800 J, the same as the 
work done by the man. This confirms the truth of the energy principle. 


6.2 ENERGY CONSERVATION IN RECTILINEAR MOTION 


The energy principle is not normally used in the general form (6.3). When possible, it 
is transformed into a conservation principle. This is most easily illustrated by the special case 
of rectilinear motion. 

Suppose that the particle P moves along the x-axis under the force F acting in the positive 
x-direction. In this case, the ‘work done’ integral (6.4) reduces to 


t2 
w=f Fovdt, 
t 


where v = x is the velocity of P in the positive x-direction. For the case in which F is a force 
field (so that F = F(x)), the formula for W becomes 


to to dx x2 
w= Fudr= | Foy ar= f F(x)dx, 
al ty dt P| 


where xj = x(t,) and x2 = x(f2). Thus, when P moves over the interval [x1, x2] of the x-axis, 
the work done by the field F is given by 


v= [- F(x) dx (6.5) 
x 
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(This is a common definition of the work done by a force F.. It can be used when F = F(x), 
but not in general.) It follows that the energy principle for a particle moving in a rectilinear 
force field can be written 


x2 
m-n= f F(x)dx. 
xX 


Xx] 


Now let V (x) be the indefinite integral of — F(x), so that 


dV %2 
F=—-— and / F(x) dx = V(x) — V(x). (6.6) 


aI 
Such a V is called the potential energy* function of the force field F. In terms of V, the 
energy principle in rectilinear motion can be written 


T2 + V (x2) =T + V1), 


which is equivalent to the energy conservation formula 


T+iV=E (6.7) 


where F is a constant called the total energy of the particle. This result can be stated as 


follows: 


Energy conservation in rectilinear motion 


When a particle undergoes rectilinear motion in a force field, the sum of its kinetic and 
potential energies remains constant in the motion. 


Example 6.2 Finding potential energies 


Find the potential energies of (i) the (one-dimensional) SHM force field, (ii) the (one- 
dimensional) attractive inverse square force field. 


Solution 
(i) The one-dimensional SHM force field is F = —ax, where a is a positive constant. 
The corresponding V is given by 


v--f Fojds =a f x dx, 


where a, the lower limit of integration, can be arbitrarily chosen. (This corresponds 
to the arbitrary choice of the constant of integration.) Note that, by beginning the 


* The potential energy corresponding to a given F is uniquely determined apart from a constant of integra- 
tion; this constant has no physical significance. 
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integration at x = a, we make V (a) = 0. In the present case it is conventional to take 


a =0so that V = 0 at x = 0. With this choice, the potential energy is V = Fax? 


(ii) The one-dimensional attractive inverse square force field is F = —K /x?, where 
x > Oand K isa positive constant. The corresponding V is given by 


x x 1 
va-f Pode =K | <= ax. 
a a x 


This time it is not possible to take a = O (the integral would then be meaningless) 
and it is conventional to take a = +00; this makes V = 0 when x = +00. With this 
choice, the potential energy is V = —K/x (x > 0). & 


Example 6.3 Rectilinear motion under uniform gravity 
A particle P is projected vertically upwards with speed u and moves under uniform 


gravity. Find the maximum height achieved and the speed of P when it returns to its 
starting point. 


Solution 
Suppose that P is projected from the origin and moves along the z-axis, where Oz 
points vertically upwards. The force F exerted by uniform gravity is F = —mg and 


the corresponding potential energy V is given by 


V= [i mg) dz = mgz. 
0 


Energy conservation then implies that 


smu + mgz = E, 


where v = Z, and the constant E is determined from the initial condition v = u when 
z =0. This gives E = smu so that the energy conservation equation for the motion 
is 

smu +mgz= imu’. 
Since v = 0 when z = Zmax, it follows that zmax = U2 /(2g). This result was obtained 
from the Second Law in Chapter 4. When P returns to O, z = 0 and so |v| = u. Thus 


P returns to O with speed u, the projection speed. 


Example 6.4 Simple harmonic motion 


A particle of mass m is projected from the point x = a with speed u and moves along 
the x-axis under the SHM force field F = —m@x. Find the maximum distance from 
O and the maximum speed achieved by the particle in the subsequent motion. 


Solution 


The potential energy correspondingto the force field F = —mw* 


xisV = 50rx?, 
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Energy conservation then implies that 


tmv- + Smar x? =f. 


where v = xX, and the constant F is determined from the initial condition |v| = u 
when x = a. This gives E = sm (u* +w*a7) so that the energy conservation equation 
for the motion becomes 


vv? + wx? =u? + wa. 


Since v = 0 when |x| takes its maximum value, it follows that 


2 1/2 
u 2 
Ion = (45 +02) : 
(40) 


Also, since the left side of the energy conservation equation is the sum of two positive 
terms, it follows that |v| takes its maximum value when x = 0. Hence 


1/2 
|U|max = (« + wa’) . 


These results could also be obtained (less quickly) by using the methods described in 
Chapter 5. @ 


6.3 GENERAL FEATURES OF RECTILINEAR MOTION 


The energy conservation equation 
smv + V(x) =E (6.8) 


enables us to deduce the general features of rectilinear motion in a force field. Since T > 0 
(and is equal to zero only when v = 0) it follows that the position of the particle is restricted 
to those values of x that satisfy 


V(x) <E, 


and that equality will occur only when v = 0. Suppose that V (x) has the form shown in Figure 
6.1 and that E' has the value shown. Then the motion of P must take place either (i) in the 
bounded interval a < x < b, or (ii) in the unbounded interval c < x < oo. Thus, if the particle 
was situated in the interval [a, b] initially, this is the interval in which the motion will take 
place. 


Bounded motions 


Suppose that the motion is started with P in the interval [a, b] and with v positive, so that P is 
moving to the right. Then, since v can only be zero at x = a and x = 5, v will remain positive 
until P reaches the point x = b, where it comes to rest*. From equation (6.8), it follows that 


* Strictly speaking, we should exclude the possibility that P might approach the point x = b asymptotically 
as t — oo, and never actually get there. This can happen, but only in the case in which the line V = E 
is a tangent to the graph of V(x) at x = b. In the general case depicted in Figure 6.1, P does arrive at 
x = bina finite time. 
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bounded unbounded 


FIGURE 6.1 Bounded and unbounded motions in a rectilinear force 
field. 


the ODE that governs this ‘right’ part of the motion is 


dx 
— =+[2E - Vey”. 
dt 
At the point x = b, V’ > 0 which implies that F < 0. P therefore moves to the left and does 
not stop until it reaches the point x = a. The ODE that governs this ‘left’ part of the motion is 
dx 
— =-2(E —V(x))]!”. 
Ai [2( (x))] 
At the point x = a, V’ < 0, which implies that F > 0 and that P moves to the right once 
again. The result is that P performs periodic oscillations between the extreme points x = a 
and x = b. Since the ‘left’ and ‘right’ parts of the motion take equal times, the period t of 
these oscillations can be found by integrating either equation over the intervala < x < b. 
Each equation is a separable ODE and integration gives 


T =2[" as 
de DEVE? 


It should be noted that these oscillations are generally not simple harmonic. In particular, their 
period is amplitude dependent. 


Example 6.5 Periodic oscillations 


A particle P of mass 2 moves on the positive x-axis under the force field F = 
(4/x?) — 1. Initially P is released from rest at the point x = 4. Find the extreme 
points and the period of the motion. 


Solution 


The force field F has potential energy V = (4/x)-+-x, so that the energy conservation 
equation for P is 


5(2)u" + (4/x) +x =E, 


138 Chapter 6 Energy conservation 


FIGURE 6.2 Positions of stable and unstable equilibrium. 


where v = x and E is the total energy. The initial condition v = 0 when x = 4 gives 
E =5 so that 


va5— (4/x) — x. 


The extreme points of the motion occur when v = 0, that is, when x = | and x = 4. 
To find the period t of the oscillations, write v = dx /dt in the last equation and take 
square roots. This gives the separable ODEs 


dx _4 E = ese)”, 
dt x 


where the plus and minus signs refer to the motion of P in the positive and negative 
x-directions respectively. Integration of either equation gives 


4 . 1/2 
t= 2f a dx ~ 9.69. H 
t Lia == x) 


Unbounded motions 


Suppose now that the motion is started with P in the interval [c, oo) and with v negative, so 
that P is moving to the left. Then, since v can only be zero at x = c, v will remain negative 
until P reaches the point x = c, where it comes to rest. At the point x = c, V’ < 0 which 
implies that F > 0. P therefore moves to the right and continues to do so indefinitely. 


Stable equilibrium and small oscillations 


First, we define what we mean by an equilibrium position. 


Definition 6.3 Equilibrium The point A is said to be an equilibrium position of P if, 
when P is released from rest at A, P remains at A. 


In the case of rectilinear motion under a force field F(x), the point x = a will be an 
equilibrium position of P if (and only if) F(a) = 0, that is, if V’(a) = 0. It follows that 
the equilibrium positions of P are the stationary points of the potential energy function V (x). 
Consider the equilibrium positions shown in Figure 6.2. These occur at stationary points of V 
that are a minimum and a maximum respectively. Suppose that P is at rest at the minimum 
point x = a when it receives an impulse of magnitude J which gives it kinetic energy AE 
(= J?/2m). The total energy of P is now E + AE, and so P will oscillate in the interval 
[a — 5,,a + 52] shown. It is clear from Figure 6.2 that, as the magnitude of J (and therefore 


6.3 General features of rectilinear motion 139 


AE) tends to zero, the ‘amplitude’ 6 of the resulting motion (the larger of 5; and 52) also tends 
to zero. This is the definition of stable equilibrium. 


Definition 6.4 Stable equilibrium Suppose that a particle P is in equilibrium at the point 
A when it receives an impulse of magnitude J ; let 6 be the amplitude of the subsequent motion. 
If 5 — Oas J — 0, then the point A is said to be a position of stable equilibrium of P.. 


On the other hand, if P is at rest at the maximum point x = b when it receives an impulse 
of magnitude J, it is clear that the amplitude of the resulting motion does not tend to zero as 
J tends to zero, so that a maximum point of V (x) is not a position of stable equilibrium of P. 
The same applies to stationary inflection points. 


Equilibrium positions of a particle 


The stationary points of the potential energy V (x) are the equilibrium points of P and 
the minimum points of V (x) are the positions of stable equilibrium. If A is a position of 
stable equilibrium, then P can execute small-amplitude oscillations about A. 


Approximate equation of motion for small oscillations 


Suppose that the point x = a is a minimum point of the potential energy V (x). Then, when x 
is sufficiently close to a, we may approximate V (x) by the first three terms of its Taylor series 
in powers of the variable (x — a), as follows: 


V(x) = V(a)+ (x —a)V'(a) + 5 (x _ a)’V"(a) 
= V(a) + 5(x —a)’V"(a), (6.9) 


since V'(a) = 0. Thus, for small amplitude oscillations about x = a, the energy conservation 
equation is approximately 


smu~ + V(a) + 5(x —a)°V"(a) = E. 


If we now differentiate this equation with respect to ¢ (and divide by v), we obtain the approx- 
imate (linearised) equation of motion 


qs 
m—> +V"(a) (x —a) =0. 
Provided that V’’(a) > 0, this is the equation for simple harmonic oscillations with angular 
frequency (V"(a)/m)!/* about the point x = a. The small oscillations of P about x = a are 


therefore approximately simple harmonic with approximate period t = 22 (m/V"(a))!/?. 


Example 6.6 Finding the period of small oscillations 


A particle P of mass 8 moves on the x-axis under the force field whose potential 
energy is 


y a x= 3? 
3 
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FIGURE 6.3 Particle P is in general motion 
under the force F’. The arc C is the path taken t=, A 
by P between the points A and B. 


Show that there is a single position of stable equilibrium and find the approximate 
period of small oscillations about this point. 


Solution 

For this V, V! = x? —4x +3 and V” = 2x —4. The equilibrium positions occur when 
V’ = 0, that is when x = 1 and x = 3. Since V"(1) = —2 and V’"(3) = 2, we deduce 
that the only position of stable equilibrium is at x = 3. The approximate period t of 
small oscillations about this point is therefore given by t = 27 (8/V"(3))!/? = 47. & 


6.4 ENERGY CONSERVATION IN A CONSERVATIVE FIELD 


Suppose now that the particle P is in general three-dimensional motion under the 
force F and that, in the time interval [t,4, tg], P moves from the point A to the point B along 
the path C, as shown in Figure 6.3. Then, by the energy principle (6.3), 


tB 
Ta -Ta= [ F-vdt, (6.10) 
tA 

where T'4 and Tz are the kinetic energies of P when t = ft, and t = tg respectively. When 
F is a force field F(r), the ‘work done’ integral on the right side of equation (6.10) can be 


written in the form 
tA tB d 
/ F-vat= [ Fy Lat = [Re -ar, 
tA tA dt c 


where C is the path taken by P in the time interval [t,4, tg]. It follows that the energy principle 
for a particle moving in a 3D force field can be written 


Tg —Ta= i. F(r) «dr. (6.11) 
€ 


Integrals like that on the right side of equation (6.11) are called line integrals. They differ 
from ordinary integrals in that the range of integration is not an interval of the x-axis, but a 
path in three-dimensional space. Line integrals are treated in detail in texts on vector field 
theory (see for example Schey [11]), but their physical meaning in the present context is clear 
enough. The quantity F - dr is the infinitesimal work done by F when P traverses the element 
dr of the path C. The line integral sums these contributions to give the total work done by F. 
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The line intgral of F along C is taken to be the definition of the work done by the force 
field F(r) when its point of application moves along any path C that connects A and B. 


Definition 6.5 3-D work done The expression 
WlA— B:c)= | Fi) -ar (6.12) 
é 


is called the work done by the force field F (r) when its point of application moves from A to 
B along any path C. 


The above definition is more than just an alternative definition of the work done by a force field acting 
on a particle. It defines the quantity W[ A — B; C] whether or not C is an actual path traversed by the 
particle P.. In this wider sense, the concept of ‘work done’ is purely notional, as is the concept of the ‘point 
of application of the force’. In this sense, W exists for a// paths joining the points A and B, but W should 
be regarded as the real work done by F only when C is an actual path of a particle moving under the field 
F(r). 


Energy conservation 


In order to develop an energy conservation principle for the general three-dimensional 
case, we need the right side of equation (6.11) to be expressible in the form 


[Fe -ar=va-Vo. (6.13) 
c 


for some scalar function of position V(r), where V4 and Vg are the values of V at the 
points A and B. In the rectilinear case, there was no difficulty in finding such a V (it was 
the indefinite integral of —F(x)). In the general case however, it is far from clear that any 
such V should exist. For, if there does exist a function V (r) satisfying equation (6.13), 
this must mean that the line integral W[ A — B; C] has the same value for all paths 
C that connect the points A and B. There is no reason why this should be true and, in 
general, it is not true. There is however an important class of fields F (7) for which V (7) 
does exist, and it is these fields that we shall consider from now on. 


Definition 6.6 Conservative field If the field F (r) can be expressed in the form* 
F = — grad V, (6.14) 


where V (r) is a scalar function of position, then F is said to be a conservative field and 
the function V is said to be the potential energy function’ for F.. 


* Tf w(r) is a scalar field then grad y is the vector field defined by 


ow, ow, ow 
eradw = k. 
eS ag ay ae 


Thus if y~ = xy, then grad yy = y>2i + 3xy22> j + 5xy zk. We could omit the minus sign in the 
definition (6.14), but the potential energy of F would then be —V instead of V. 

+ Tf V exists, then it is unique apart from a constant of integration. As in the rectilinear case, this constant 
has no physical significance. 
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Example 6.7 Conservative or not conservative? 


Show that the field F,; = —2xi — 2y Jj — 2zk is conservative but that the field F2 = 
yi — xf is not. 


Solution 
(i) If F, is conservative, then its potential energy V must satisfy 


and these equations integrate to give 
cag aye = 52 . 
V=x + p(y, Z), V=y + q(x, Z), VYS2 +r(x,y), 


where p,g andr are ‘constants’ of integration, which, in this case, are functions of the 
other variables. If V really exists, then these three representations of V can be made 
identical by making special choices of the functions p, g andr. In this example it is 
clear that this can be achieved by taking p = y* + 27,g =x*+2*,andr = x?+4+ y’. 
Hence F; = — grad (x? + y? +z?) and so F| is conservative. 


(11) If Fz is conservative, then its potential energy V must satisfy 


OV OV oV 


There is no V that satisfies these equations simultaneously. The easiest way to show 
this is to observe that, from the first equation, 9?7V/dydx = —1 while, from the 
second equation, 8°V /8xdy = +1. Since these mixed partial derivatives of V should 
be equal, we have a contradiction. The conclusion is that no such V exists and that 
F > is not conservative. 


Suppose now that the field F(r) is conservative with potential energy V(r) and let C 


be any path connecting the points A and B. Then 


wiA> 8: CI= | Fo)-dr= f (ena) ar 
Cc Cc 


aV aV aV 
= / i+ J+ k)-(dxi+dyj+dzk) 
c \ Ox dy Oz 


Va Maye My 
=— | —dx+— —dz 
c Ox ay | Oz 


- | av =Va-Vo. (6.15) 
C 


Thus, when F is conservative with potential energy V, 


[Po -ar=va—Vo. 
Cc 
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for any path C connecting the points A and B. The energy principle (6.11) can therefore 
be written 


Tg + Veg =Ta+Va 


which is equivalent to the energy conservation formula 


T+V=E (6.16) 


Our result can be summarised as follows: 


Energy conservation in 3-D motion 


When a particle moves in a conservative force field, the sum of its kinetic and 
potential energies remains constant in the motion. 


The condition that F be conservative seems restrictive, but most force fields encoun- 
tered in mechanics actually are conservative! 


Example 6.8 Finding 3-D potential energies 


(a) Show that the uniform gravity field F = —mgk is conservative with potential 
energy V = mgz. 


(b) Show that any force field of the form 
F=h(r)fF 


(a central field) is conservative with potential energy V = —H(r), where H(r) is the 
indefinite integral of h(). Use this result to find the potential energies of (i) the 3-D 
SHM field F = —ar 7, and (ii) the attractive inverse square field F = —(K/r?)7, 
where a and K are positive constants. 


Solution 
Since the potential energies are given, it is sufficient to evaluate — grad V in each case 
and show that this gives the appropriate F’. Case (a) is immediate. In case (b), 


dH (r) = dH or 
dx —sdr ‘Ox 


ee x 
=H'(r)= =hr)=, 
r r 
since r = (x? + y? + yi? and H’(r) = h(r). Thus 
— grad [—H(r)] = h(r) (= ae Boal k) =n) =A), 
r r r r 


as required. 
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In particular then, the potential energy of the SHM field F = —arris V = 
sar’, and the potential energy of the attractive inverse square field F = —(K/r?)? 
isV =—K/r.@ 


Example 6.9 Projectile motion 


A body is projected from the ground with speed u and lands on the flat roof of a build- 
ing of height h. Find the speed with which the projectile lands. [Assume uniform 
gravity and no air resistance. ] 


Solution 


Since uniform gravity is a conservative field with potential energy mgz, energy con- 
servation applies in the form 


Sm|v|* +mgz =, 


where O is the initial position of the projectile and Oz points vertically upwards. 
From the initial conditions, E = smu. Hence, when the body lands, 


2 


p24 mgh = smu ; 


1 
zm|v 


where v¢ is the landing velocity. The landing speed is therefore 
1/2 
| v/ [= CG _ 2gh) : 


Thus, energy conservation determines the speed of the body on landing, but not its 
velocity. @ 


Example 6.10 Escape from the Moon 


A body is projected from the surface of the Moon with speed u in any direction. Show 
that the body cannot escape from the Moon if u2 <2MG/R, where M and R are the 
mass and radius of the Moon. [Assume that the Moon is spherically symmetric.] 


Solution 


If the Moon is spherically symmetric, then the force F' that it exerts on the body is 
given by F = —(mMG/r?)r, where m is the mass of the body, and r is the position 
vector of the body relative to an origin at the centre of the Moon. This force is a 
conservative field with potential energy V = —mMG/r. Hence energy conservation 
applies in the form 


and, from the initial conditions, E = smu —(mMG)/R. Thus the energy conserva- 
tion equation is 


1 1 
|v|> =u? +2MG (=-): 
r R 
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6.5 


Since the left side of the above equation is positive, the values of r that occur in the 
motion must satisfy the inequality 


2 1 1 
uu +2MG({--—-—]>0. 
rR 
If the body is to escape, this inequality must hold for arbitrarily large r. This means 
that the condition 


2MG 
vs S0 


is necessary for escape. Hence if u2 < 2MG/R, the body cannot escape. The inter- 
esting feature here is that the ‘escape speed’ is the same for all directions of projection 
from the surface of the Moon. (The special case in which the body is projected verti- 
cally upwards was solved in Chapter 4.) 


Example 6.11 Stability of equilibrium in a 3-D conservative field 


A particle P of mass m can move under the gravitational attraction of two particles, 
of equal mass M, fixed at the points (0, 0, +a). Show that the origin O is a position 
of equilibrium, but that it is not stable. [This illustrates the general result that no 
free-space static gravitational field can provide a position of stable equilibrium. ] 


Solution 
When P is at O, the fixed particles exert equal and opposite forces so that the total 
force on P is zero. The origin is therefore an equilibrium position for P. 

Just as in rectilinear motion, O will be a position of stable equilibrium if the 
potential energy function V (x, y, z) has a minimum at O. This means that the value 
of V at O must be less than its values at a// nearby points. But at points on the z-axis 
between z = —a andz =a 

mMG mMG 2amMG 


V(0,0,z) = = ; 
( ) a-z at+z a* — 2? 


which has a maximum at z = 0. Hence the equilibrium at O is unstable to disturbances 
in the z-direction. Hl 


ENERGY CONSERVATION IN CONSTRAINED MOTION 


Some of the most useful applications of energy conservation occur when the mov- 


ing particle is subject to geometrical constraints, such as being connected to a fixed point 


by a light inextensible string, or being required to remain in contact with a fixed rigid 


surface (see section 4.2). Since constraint forces are not known beforehand one may won- 


der how to find the work that they do. The answer is that, in the idealised problems that 


we study, the work done by the constraint forces is often zero. In these cases the con- 


straint forces make no contribution to the energy principle and they can be disregarded. 


Situations in which constraint forces do no work include: 
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R 


FIGURE 6.4 The particle P slides over the 
fixed smooth surface S. The reaction R is 
normal to S and hence perpendicular to v, the 
velocity of P. 


Some constraint forces that do no work 


e A particle connected to a fixed point by a light inextensible string; here the string 
tension does no work. 

e A particle sliding along a smooth fixed wire; here the reaction of the wire does 
no work. 

e A particle sliding over a smooth fixed surface; here the reaction of the surface 
does no work. 


Consider for example the case of a particle P sliding over a smooth fixed surface S as 
shown in Figure 6.4. Because S is smooth, any reaction force R that it exerts must always 
be normal to S. But, since P remains on S, its velocity v must always be tangential to S. 
Hence R is always perpendicular to v so that R + v = 0. Thus the rate of working of R 
is zero and so R makes no contribution to the energy principle. Very similar arguments 
apply to the other two cases. 

We may now extend the use of conservation of energy as follows: 


Energy conservation in constrained motion 


When a particle moves in a conservative force field and is subject to constraint 
forces that do no work, the sum of its kinetic and potential energies remains con- 
stant in the motion. 


Example 6.12 The snowboarder 


A snowboarder starts from rest and descends a slope, losing 320 m of altitude in the 
process. What is her speed at the bottom? [Neglect all forms of resistance and take 
g=10ms~”.] 
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Solution 


The snowboarder moves under uniform gravity and the reaction force of the smooth 
hillside. Since this reaction force does no work, energy conservation applies in the 
form 


Sm|v|* +mgz = E, 


where m and v are the mass and velocity of the snowboarder, and z is the altitude of 
the snowboarder relative to the bottom of the hill. If the snowboarder starts from rest 
at altitude h, then E = 0+ megh. Hence, at the bottom of the hill where z = 0, her 
speed is 


|v| = (2gh)'”?, 


just as if she had fallen down a vertical hole! This speed evaluates to 80 ms~!, about 
180 mph. [At such speeds, air resistance would have an important influence.] 


Our next example concerns a particle constrained to move on a vertical circle. This is 
one of the classical applications of the energy conservation method. There are two distinct 
cases: (i) where the particle is constrained always to remain on the circle, or (ii) where 
the particle is constrained to remain on the circle only while the constraint force has a 
particular sign. 


Example 6.13 Motion in a vertical circle 


A fixed hollow sphere has centre O and a smooth inner surface of radius b. A particle 
P, which is inside the sphere, is projected horizontally with speed u from the lowest 
interior point (see Figure 6.5). Show that, in the subsequent motion, 


va 2gb(1 — cos@), 
provided that P remains in contact with the sphere. 


Solution 


While P remains in contact with the sphere, the motion is as shown in Figure 6.5. 
The forces acting on P are uniform gravity mg and the constraint force N, which 
is the normal reaction of the smooth sphere. Since N is always perpendicular to v 
(the circumferential velocity of P), it follows that N does no work. Hence energy 
conservation applies in the form 


2 


smu —mgbcosd= E, 


where m is the mass of P, and the zero level of the potential energy is the horizontal 
plane through O. Since v = u when 6 = 0, it follows that E = smu — mgb and the 
energy conservation equation becomes 


v? = u* — 2gb(1 — cos8), (6.17) 
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v=b0 


FIGURE 6.5 Particle P slides on the smooth 
inner surface of a fixed sphere. The motion 
takes place in a vertical plane through the 
centre O. 


as required. This gives the value of v as a function of @ while P remains in contact 
with the sphere. ml 


Question The reaction force 
Find the reaction force N as a function of 0. 


Answer 


Once the motion is determined (for example by equation (6.17)), the unknown con- 
straint forces may be found by using the Second Law in reverse. In the present case, 


—_ 
consider the component of the Second Law F = ma in the direction PO. This gives 
N —mgcos@ = mv~/b, 


where we have made use of the formula (2.17) for the acceleration of a particle in 
general circular motion. On using the formula for v* from equation (6.17), we obtain 


mu 
N= en eee ee (6.18) 


This gives the value of N as a function of 6 while P remains in contact with the 
sphere. 


Question Does P leave the surface of the sphere? 


For the particular case in which u = (3gb)!/?, show that P will leave the surface of 
the sphere, and find the value of @ at which it does so. 


Answer 
When u = (3¢b)!/?, the formulae (6.17), (6.18) for v* and N become 


v? = gh(1 + 2cos8), N =mg(1+3cos@). 


If P remains in contact with the sphere, then it comes to rest when v = 0, that is, 
when cos@ = —1/2. This first happens when 6 = 120° (a point on the upper half of 
the sphere, higher than O). If P were threaded on a circular wire (from which it could 
not fall off) this is exactly what would happen; P would perform periodic oscillations 
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in the range —120° < 0 < 120°. However, in the present case, the reaction N is 
restricted to be positive and this condition will be violated when 6 > cos! (—1/3) © 
109°. Since this angle is less than 120°, the conclusion is that P loses contact with 
the sphere when 9 = cos~!(—1/3); at this instant, the speed of P is (gb/3)!/*. P 
then moves as a free projectile until it strikes the sphere. Hl 


Question Complete circles 


How large must the initial speed be for P to perform complete circles? 


Answer 


For complete circles to be executed, it is necessary (and sufficient) that v > 0 and 
N > Oat all times, that is, 


ur > 2gb(1 — cos@) and ur > gb(2 — 3cos@) 


for all values of 6. For these inequalities to hold for all 0, the speed u must satisfy 
u> > 4gb and u* > Sgb respectively. Since the second of these conditions implies 
the first, it follows that P will execute complete circles if u? > 5gb. 


Example 6.14 Small oscillations in constrained motion 


A particle P of mass m can slide freely along a long straight wire. P is connected 
to a fixed point A, which is at a distance 4a from the wire, by a light elastic cord of 
natural length 3a and strength a. Find the approximate period of small oscillations of 
P about its equilibrium position. 


Solution 


Suppose P has displacement x from its equilibrium position. In this position, the 
length of the cord is (16a* + x)!/? and its potential energy V is 


1/2 2 
V= 5a G + i) aa 3a 
1/2 
= 5a 254? o> = 6a (160? + x) : 

The energy conservation equation for P is therefore 

1 2 A 2 2 2 a\ V7 

xzMx" + 5a 25a" + x — 6a (16a +x?) =F, 
which, on neglecting powers of x higher than the second, becomes 


132, 1 2 Z 
5Mx 5 SS ee oe 
5mx + 5 E + | 


On differentiating this equation with respect to t, we obtain the approximate lin- 
earised equation of motion 


eae8 0 
MX —x =U. 
4 
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This is the SHM equation with w? = a/4m. It follows that the approximate period 
of small oscillations about x = 0 is 477(m/a)!/?. m 


Energy conservation from a physical viewpoint 


Suppose that a particle P of mass m can move on the x-axis and is connected to a fixed 
post at x = —a by a light elastic spring of natural length a and strength a. Then the force 
F (x) exerted on P by the spring is given by F = —ax, where x is the displacement of P 
in the positive x-direction. This force field has potential energy V = sax? and the energy 
conservation equation for P takes the form 


smu + sax? = E, (6.19) 


where v = x. Here, the spring is regarded merely as an agency that supplies a force field with 
potential energy V = ax? However, there is a much more satisfying interpretation of the 
energy conservation principle (6.19) that can be made. 

To see this we consider the spring as described above, but now with no particle attached 
to its free end. Suppose that the spring is in equilibrium (with its free end at x = 0) when an 
external force G(t) is applied there. This force is initially zero and increases so that, at any 
time ¢, the spring has extension X = G(t)/a. Suppose that this process continues until the 
spring has extension A. Then the total work done by the force G(t) in producing this extension 


is given by 


is . T dX A 
i Ginkar= | ax at = f aX dX = xa A’. 
0 0 dt 0 


Since the force exerted by the fixed post does no work, the total work done by the external 
forces in producing the extension A is 5a A*. Suppose now that the spring is ‘frozen’ in its 
extended state (by being propped open, for example) while the particle P is connected to the 
free end. The system is then released from rest. The energy conservation equation for P is 
given by equation (6.19), where, from the initial condition v = 0 when x = A, the total energy 
E= 5a A?, This gives 
smu + sax? — sa, (6.20) 
Thus, the total energy in the subsequent motion is equal to the original work done in stretching 
the spring. The natural physical interpretation of this is that the spring is able to store the work 
that is done upon it as internal energy. Then, when the particle is connected and the system 
released, this stored energy is available to be transferred to the particle in the form of kinetic 
energy. Equation (6.20) can thus be interpreted as an energy conservation principle for the 
particle and spring together, as follows: Jn any motion of the particle and spring, the sum 
of the kinetic energy of the particle and the internal energy of the spring remains constant. In 
this interpretation, the particle has no potential energy; instead, the spring has internal energy. 
In the above example, the particle and the spring can pass energy to each other, but the 
total of the two energies is conserved. This is the essential nature of energy. It is an entity that 
can appear in different forms but whose total is always conserved. Energy is probably the 
most important notion in the whole of physics. However, it should be remembered that, in the 
context of mechanics, it is not usual to take account of forms of energy such as heat or light. 
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As aresult, we will find situations (inelastic collisions, for example) in which energy seems to 


disappear. There is no contradiction in this; the energy has simply been transferred into forms 
that we choose not to recognise. 


Problems on Chapter 6 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Unconstrained motion 


6.1 A particle P of mass 4 kg moves under the action of the force F = 4i + 12t7 j N, where 
t is the time in seconds. The initial velocity of the particle is 2i + j + 2k ms7!. Find the 
work done by F, and the increase in kinetic energy of P, during the time interval 0 < t < 1. 
What principle does this illustrate? 


6.2 In a competition, a man pushes a block of mass 50 kg with constant speed 2 ms~! up a 
smooth plane inclined at 30° to the horizontal. Find the rate of working of the man. [Take 
g=10ms~?.] 


6.3 An athlete putts a shot of mass 7 kg a distance of 20 m. Show that the athlete must do to 
at least 700 J of work to achieve this. [ Ignore the height of the athlete and take g = 10 ms~?.] 


6.4 Find the work needed to lift a satellite of mass 200 kg to a height of 2000 km above the 
Earth’s surface. [Take the Earth to be spherically symmetric and of radius 6400 km. Take the 
surface value of g to be 9.8 ms~.] 


6.5 A particle P of unit mass moves on the positive x-axis under the force field 


369 


x 
Show that each motion of P consists of either (i) a periodic oscillation between two extreme 
points, or (ii) an unbounded motion with one extreme point, depending upon the value of the 
total energy. Initially P is projected from the point x = 4 with speed 0.5. Show that P 
oscillates between two extreme points and find the period of the motion. [You may make use 
of the formula 


im xdx _ m(a+b) 1 
a [x-a)(b—-x)/2 0 2 


Show that there is a single equilibrium position for P and that it is stable. Find the period 
of small oscillations about this point. 


6.6 A particle P of mass m moves on the x-axis under the force field with potential energy 
V = Vox /b)*, where Vo and b are positive constants. Show that any motion of P consists 
of a periodic oscillation with centre at the origin. Show further that, when the oscillation has 
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amplitude a, the period t is given by 


mye pe 
= 202(T) a a= E12" 


[Thus, the larger the amplitude, the shorter the period! ] 


6.7 A particle P of mass m, which is on the negative x-axis, is moving towards the origin 
with constant speed u. When P reaches the origin, it experiences the force F = —K x”, where 
K is a positive constant. How far does P get along the positive x-axis? 


6.8 A particle P of mass m moves on the x-axis under the combined gravitational attraction 
of two particles, each of mass M, fixed at the points (0, a, 0) respectively (see Figure 3.3). 
Example 3.4 shows that the force field F (x) acting on P is given by 


= 2nmMGx 
— (a2 + x2)3/2° 
Find the corresponding potential energy V (x). 


Initially P is released from rest at the point x = 3a/4. Find the maximum speed achieved 
by P in the subsequent motion. 


6.9 A particle P of mass m moves on the axis Oz under the gravitational attraction of a 
uniform circular disk of mass M and radius a as shown in Figure 3.6. Example 3.6 shows that 
the force field F(z) acting on P is given by 


2mMG 1 Zz 
~ @ (a2 + 22)1/72 
Find the corresponding potential energy V (z) for z > 0. 


Initially P is released from rest at the point z = 4a/3. Find the speed of P when it hits 
the disk. 


(z > 0). 


6.10 A catapult is made by connecting a light elastic cord of natural length 2a and strength a 
between two fixed supports, which are distance 2a apart. A stone of mass m is placed at the 
center of the cord, which is pulled back a distance 3a/4 and then released from rest. Find the 
speed with which the stone is projected by the catapult. 


6.11 A light spring of natural length a is placed on a horizontal floor in the upright position. 
When a block of mass M is resting in equilibrium on top of the spring, the compression of the 
spring is a/15. The block is now lifted to a height 3a/2 above the floor and released from rest. 
Find the compression of the spring when the block first comes to rest. 


6.12 A particle P carries a charge e and moves under the influence of the static magnetic field 
B(r) which exerts the force F = ev x B on P, where v is the velocity of P. Show that P 
travels with constant speed. 


6.13% A mortar shell is to be fired from level ground so as to clear a flat topped building of 
height / and width a. The mortar gun can be placed anywhere on the ground and can have 
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any angle of elevation. What is the least projection speed that will allow the shell to clear the 
building? [Hint How is the reqired minimum projection speed changed if the mortar is raised 
to rooftop level?] 

For the special case in which h = 5a, find the optimum position for the mortar and the 
optimum elevation angle to clear the building. 


If you are a star at electrostatics, try the following two problems: 


6.14% An earthed conducting sphere of radius a is fixed in space, and a particle P, of mass 
m and charge gq, can move freely outside the sphere. Initially P is a distance b (> a) from the 
centre O of the sphere when it is projected directly away from O. What must the projection 
speed be for P to escape to infinity? [Ignore electrodynamic effects. Use the method of images 
to solve the electrostatic problem.] 


6.15% An uncharged conducting sphere of radius a is fixed in space and a particle P, of mass 
m and charge gq, can move freely outside the sphere. Initially P is a distance b (> a) from the 
centre O of the sphere when it is projected directly away from O. What must the projection 
speed be for P to escape to infinity? [Ignore electrodynamic effects. Use the method of images 
to solve the electrostatic problem. ] 


Constrained motion 


6.16 A bead of mass m can slide on a smooth circular wire of radius a, which is fixed in 
a vertical plane. The bead is connected to the highest point of the wire by a light spring of 
natural length 3a/2 and strength 4. Determine the stability of the equilibrium position at the 
lowest point of the wire in the cases (i) @ = 2mg/a, and (ii) A = Smga. 


6.17 A smooth wire has the form of the helix x = acos6@, y = asin@, z = b0, where 6 is 
a real parameter, and a, b are positive constants. The wire is fixed with the axis Oz pointing 
vertically upwards. A particle P, which can slide freely on the wire, is released from rest at the 
point (a, 0, 27b). Find the speed of P when it reaches the point (a, 0, 0) and the time taken 
for it to do so. 


6.18 A smooth wire has the form of the parabola z = x?/2b, y = 0, where b is a positive 
constant. The wire is fixed with the axis Oz pointing vertically upwards. A particle P, which 
can slide freely on the wire, is performing oscillations with x in the range —a < x < a. Show 
that the period t of these oscillations is given by 


4 a b? + x? 1/2 
t= dx. 
aoe f (3 ) 


By making the substitution x = a sin y in the above integral, obtain a new formula for t. Use 
this formula to find a two-term approximation to T, valid when the ratio a/b is small. 


6.19% A smooth wire has the form of the cycloid x = c(@ + sin@), y = 0,z =c(1 —cos@), 
where c is a positive constant and the parameter 6 lies in the range —m7 < 0 < x. The wire 
is fixed with the axis Oz pointing vertically upwards. [Make a sketch of the wire.] A particle 
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initial position @~-~ 


FIGURE 6.6 The swing of the pendulum is obstructed by a fixed peg. 


can slide freely on the wire. Show that the energy conservation equation 1s 
(1 + cos 0) 6? + ag — cos @) = constant. 
c 


A new parameter u is defined by u = sin 50. Show that, in terms of u, the equation of motion 
for the particle is 
i § ) 
u —)u. 
= be 


Deduce that the particle performs oscillations with period 4 (c/g)!/*, independent of the 
amplitude! 


6.20 A smooth horizontal table has a vertical post fixed to it which has the form of a circular 
cylinder of radius a. A light inextensible string is wound around the base of the post (so that 
it does not slip) and its free end of the string is attached to a particle that can slide on the 
table. Initially the unwound part of the string is taut and of length 4a/3. The particle is then 
projected horizontally at right angles to the string so that the string winds itself on to the post. 
How long does it take for the particle to hit the post? [You may make use of the formula 


[a + ¢7)'Pdb = 361 + 97)? + 5 sinh! o. ] 


6.21 A heavy ball is suspended from a fixed point by a light inextensible string of length 
b. The ball is at rest in the equilibrium position when it is projected horizontally with speed 
(7gb/2)'/?. Find the angle that the string makes with the upward vertical when the ball begins 
to leave its circular path. Show that, in the subsequent projectile motion, the ball returns to its 
starting point. 


6.22% A new avant garde mathematics building has a highly polished outer surface in the 
shape of a huge hemisphere of radius 40 m. The Head of Department, Prof. Oldfart, has his 
student, Vita Youngblood, hauled to the summit (to be photographed for publicity purposes) 
but a small gust of wind causes Vita to begin to slide down. Oldfart’s displeasure is increased 
when Vita lands on (and severely damages) his car which is parked nearby. How far from the 
outer edge of the building did Oldfart park his car? Did he get what he deserved? (Happily, 
Vita escaped injury and found a new supervisor.) 


6.23% * A heavy ball is attached to a fixed point O by a light inextensible string of length 2a. 
The ball is drawn back until the string makes an acute angle a with the downward vertical and 
is then released from rest. A thin peg is fixed a distance a vertically below O in the path of 
the string, as shown in Figure 6.6. In a game of skill, the contestant chooses the value of a and 
wins a prize if the ball strikes the peg. Show that the winning value of a is approximately 86°. 


Chapter Seven 


Orbits in a central field 


including Rutherford scattering 


KEY FEATURES 

For motion in general central force fields, the key results are the radial motion equation and 
the path equation. For motion in the inverse square force field, the key formulae are the 
E-formula, the L-formula and the period formula. 


The theory of orbits has a special place in classical mechanics for it was the desire 
to understand why the planets move as they do which provided the major stimulus in 
the development of mechanics as a scientific discipline. Early in the seventeenth cen- 
tury, Johannes Kepler * published his ‘laws of planetary motion’, which he deduced by 
analysing the accurate experimental observations made by the astronomer Tycho Brahe." 


The German mathematician and astronomer Johannes Kepler (1571-1630) was a firm believer in the 
Copernican (heliocentric) model of the solar system. In 1596 he became mathematical assistant to Tycho 
Brahe, the foremost observational astronomer of the day, and began working on the intractable problem 
of the orbit of Mars. This work continued after Tycho’s death in 1601 and, after much labour, Kepler 
showed that Tycho’s observations of Mars corresponded very precisely to an elliptic orbit with the Sun at 
a focus. This result, together with the ‘law of areas’ (the second law) was published in 1609. Kepler then 
found similar orbits for other planets and his third law was published in 1619. 

Tycho Brahe (1546-1601) was a Danish nobleman. He had a lifelong interest in observational astronomy 
and developed a succession of new and more accurate instruments. The King of Denmark gave him 
money to create an observatory and also the island of Hven on which to build it. It was here that Tycho 
made his accurate observations of the planets from which Kepler was able to deduce his laws of planetary 
motion. Tycho’s other claim to fame is that he had a metal nose. When the original was cut off in a duel, 
he had an artificial nose made from an alloy of silver and gold. Tycho is perhaps better remembered for 
his nose job than he is for a lifetime of observations. 


=b 
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FIGURE 7.1 Each planet P moves on an 
elliptical path with the Sun S at one focus. 
The area A is that referred to in Kepler’s 
second law. 


Kepler’s laws of planetary motion 


First law Each of the planets moves on an elliptical path with the Sun at one focus 
of the ellipse. 


Second law For each of the planets, the straight line connecting the planet to the 
Sun sweeps out equal areas in equal times. 


Third law The squares of the periods of the planets are proportional to the cubes of 
the major axes of their orbits. 


The problem of determining the law of force that causes the motions described by 
Kepler (and proving that it does so) was the most important scientific problem of the sev- 
enteenth century. In what must be the finest achievement in the whole history of science, 
Newton’s publication of Principia in 1687 not only proved that the inverse square law of 
gravitation implies Kepler’s laws, but also laid down the entire framework of the science 
of mechanics. Orbit theory is just as important today, the principal fields of application 
being astronomy, particle scattering and space travel. 

In this chapter, we treat the problem of a particle moving in a central force field with a 
fixed centre; this is called the one-body problem. The assumption that the centre of force 
is fixed is an accurate approximation in the context of planetary orbits. The combined 
mass of all the planets, moons and asteroids is less than 0.2% of the mass of the Sun. We 
therefore expect the motion of the Sun to be comparatively small, as are inter-planetary 
influences.* However, we do not confine our interest to motion under the attractive inverse 
square field. At first, we consider motion in any central force field with a fixed centre. This 
part of the theory will then apply not only to gravitating bodies, but also (for example) to 
the scattering of neutrons. The important cases of inverse square attraction and repulsion 
are then examined in greater detail. 


* The more general two-body problem is treated in Chapter 10. The two-body theory must be used to 
analyse problems in which the masses of the two interacting bodies are comparable, as they are in binary 
stars. 
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FIGURE 7.2 Each orbit of a particle P in a central force field with centre O 
takes place in a plane through O. The position of P in the plane of motion is 
specified by polar coordinates r, 6 with centre at O. 


7.1. THE ONE-BODY PROBLEM — NEWTON’S EQUATIONS 


First we define what we mean by a central force field. 


Definition 7.1 Central field A force field F(r) is said to be a central field with centre 
O if it has the form 


F(r) = F(r)?, 


where r = |r| and = r/r. A central field is thus spherically symmetric about its centre. 


A good example of a central force is the gravitational force exerted by a fixed point 
mass. Suppose P has mass m and moves under the gravitational attraction of a point mass 
M fixed at the origin. In this case, the force acting on P is given by the law of gravitation 
to be 


where G is the constant of gravitation. This is a central field with 


mMG 


Fr) =-—-. 


Each orbit lies in a plane through the centre of force 


The first thing to observe is that, when a particle P moves in a central field with centre 
O, each orbit of P takes place in a plane through O, as shown in Figure 7.2. This is the 
plane that contains O and the initial position and velocity of P. One may give a vectorial 
proof of this, but it is quite clear on symmetry grounds that P will never leave this plane. 
Each motion is therefore two-dimensional and we take polar coordinates r, 6 (centred on 
O) to specify the position of P in the plane of motion. On using the formulae (2.14) for 
the components of acceleration in polar coordinates, the Newton equations of motion for 
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FIGURE 7.3 The angular momentum mr26 = mp v, where 
v= vj. 


P become 


i (i = 6?) = F(r), (7.1) 
m (ré + 276) = 0. (7.2) 


Angular momentum conservation 

Equation (7.2) can be written in the form 
a ae 
—— (mr?6) =0, 
aN” 


which can be integrated with respect to f to give 
mr76 = constant. 


The quantity mr?6, which is a constant of the motion, is called the angular momentum* 
of P. The general theory of angular momentum (and its conservation) is described in 
Chapter 11, but for now it is sufficient to regard ‘angular momentum’ simply as a name that 
we give to the conserved quantity mr. This angular momentum has a simple kinematical 
interpretation. From Figure 7.3 it follows that 


mr?6 = mr (r6) = mrve = m(r cos @) ( ) 
COs a 
=mpv, 


where p is the perpendicular distance of O from the tangent to the path of P, and v = 
|v|. This formula provides the usual way of calculating the constant value the angular 
momentum from the initial conditions. 


* More precisely, it is the angular momentum of the particle about the axis {O, k}, where the unit vector k 
is perpendicular to the plane of motion (see Figure 7.2). The angular momentum of P about the point O 
is the vector quantity mrxv, but the axial angular momentum used in the present chapter is the component 
of this vector in the k-direction. 
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Newton equations in specific form 


It is usual and convenient to eliminate the mass m from the theory. If we write 
Fir) =mf(r), 


where f(r) is the outward force per unit mass, and let L (= 20) be the angular momen- 
tum per unit mass then the Newton equations (7.1), (7.2) reduce to the specific form 


i —ré? = f(r), (7.3) 
Peat, (7.4) 
where L is a constant.* Note that these equations apply to orbits in any central field. The 


second of these equations appears throughout this chapter and we will call it the angular 
momentum equation. 


Angular momentum equation 
; (3) 
r76=L 


Kepler’s second law 


Angular momentum conservation is equivalent to Kepler’s second law. The area A shown 
in Figure 7.1 can be expressed (with an obvious choice of initial line) as 


0 
A=5 / r dé. 
0 
Then, by the chain rule, 


dA dA _ dé 
= x 
dt dé dt 


_ 1,24 _ 1 
= 5) 06=35L, 


where L is the constant value of the angular momentum. Thus A increases at a constant 
rate, which is what Kepler’s second law says. Thus Kepler’s second law holds for ail 
central force fields, not just the inverse square law. 


7.2 | GENERAL NATURE OF ORBITAL MOTION 


In our first method of solution, we take as our starting point the principles of 
conservation of angular momentum and energy. 


* Without losing generality, we will take L to be positive, that is, we suppose 0 is increasing with time. 
(The special case in which L = 0 corresponds to rectilinear motion through O.) 
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Energy conservation 


Every central field F = mf (r)r is conservative with potential energy mV (r), where 


dV 
to ers, (7.6) 


Energy conservation then implies that 
TI+V=E, 


where T is the specific kinetic energy, V is the specific potential energy, and the constant 
E is the specific total energy. On replacing T by its expression in polar coordinates, we 
obtain 


Energy equation 
‘ds : (7.7) 
5 (7? + (OP) + V0) =E 


as the energy conservation equation. The conservation equations (7.5), (7.7) are equiva- 
lent to the Newton equations (7.1), (7.2) and are a convenient starting point for investigat- 
ing the general nature of orbital motion. 


The radial motion equation 


From the angular momentum conservation equation (7.5), we have 
6=L/r* 


and, on eliminating 6 from the energy conservation equation (7.7), we obtain 


1 32 L? 
DP +VON+s5=E, (7.8) 


an ODE for the radial distance r(t). We call this the radial motion equation for the 
particle P. Equation (7.8) (together with the initial conditions) is sufficient to determine 
the variation of r with f, and the angular momentum equation (7.5) then determines the 
variation of 6 with ¢. Unfortunately, for most laws of force, this procedure cannot be 
carried through analytically. However, it is still possible to make important deductions 
about the general nature of the motion. 

Equation (7.8) can be written in the form 


sr 4+V*(r) =E, (7.9) 


where 


* L? 
V*(r)=V(r)+ rk (7.10) 
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unbounded 


FIGURE 7.4 The effective potential V* shown admits bounded and 
unbounded orbits, depending on the initial conditions. 


The function V*(r) is called the effective potential of the radial motion and its use 
reduces the radial motion of P to a rectilinear problem. It must be emphasised 
though that the whole motion is two-dimensional since @ is increasing in accordance 
with (7.5). 

Because r satisfies the radial motion equation (7.9), the variation of r with ¢ can be 
analysed by using the same methods as were used in Chapter 6 for rectilinear particle 
motion. In particular, the general nature of the motion depends on the shape of the graph 
of V* (which depends on L) and the value of E. The values of the constants L and E 
depend on the initial conditions. 

Suppose for example that the law of force and the initial conditions are such that V* 
has the form shown in Figure 7.4 and that E has the value shown. Then, since T > 0, it 
follows that the motion is restricted to those values r that satisfy the inequality 


V*() <E, 


with equality holding when + = 0. There are two possible motions, in each of which the 
variation of r with ¢ is governed by the radial motion equation (7.8). 


(i) a bounded motion in which r oscillates in the range [a, b]. In this motion, r(t) is 
a periodic function.* 


(ii) an unbounded motion in which r lies in the interval [c, oo). In this motion r is 
not periodic but decreases until the minimum value r = c is achieved and then 
increases without limit. 


The bounded orbit. A typical bounded orbit is shown in Figure 7.5 (left). The orbit 
alternately touches the inner and outer circles r = a andr = b, which corresponds to 
the radial coordinate 7 oscillating in the interval [a, b]. Without losing generality, suppose 
that P is at the point Bj when t = 0 and that OB, is the line 6 = 0. Consider the part of 


* The fact that r(t) is periodic does not mean that the whole motion must be periodic. 
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bounded orbit unbounded orbit 


FIGURE 7.5 Typical bounded and unbounded orbits. 


the orbit between A, and A2. It follows from the governing equations (7.8), (7.5) that r is 
an even function of t while @ is an odd function of t. This means that the segment By A2 of 
the orbit is just the reflection of the segment A,B, in the line O B,. This argument can be 
repeated to show that the segment A7Bz> is the reflection of the segment B, A2 in the line 
O Az, and so on. Thus the whole orbit can be constructed from a knowledge of a single 
segment such as A; By. 

It follows from what has been said that the angles AiO Bi, B 10. A2, A2 O Bo (and 
so on) are all equal. Let a be the common value of these angles. Then the orbit will 
eventually close itself if some integer multiple of a is equal to some whole number of 
complete revolutions, that is, if a@/z is a rational number. There is no reason to expect 
this condition to hold and, in general, it does not. It follows that these bounded orbits are 
not generally closed. The closed orbits associated with the attractive inverse square field 
are therefore exceptional, rather than typical! 


The unbounded orbit. In the unbounded case there are just two segments both of which 
are semi-infinite (see Figure 7.5 (right)). The segment in which P recedes from O is the 
reflection of the segment in which P approaches O in the line OC. 


Apses and apsidal distances 


The points at which an orbit touches its bounding circles are important and are given a 
special name: 


Definition 7.2 Apse, apsidal distance, apsidal angle A point of an orbit at which 
the distance OP. achieves its maximum or minimum value is called an apse of the orbit. 
These maximum and minimum distances are called the apsidal distances and the angu- 
lar displacement between successive apses (the angle a in Figure 7.5 (left)) is called the 
apsidal angle. 
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unbounded 


FIGURE 7.6 Left: The effective potential V* for the attractive inverse square force. Right: The 
path of the asteroid around the Sun. C is the point of closest approach. 


In the special case of orbits around the Sun, the point of closest approach is called the 


perihelion and the point of maximum distance the aphelion. The corresponding terms for 


orbits around the Earth are perigee and apogee. 


The apsidal distances, the maximum and minimum distances of P from O, are easily 


found from the radial motion equation (7.8). At an apse, 7 = 0 and sor must satisfy 


L2 
V(r —-&, FAl 
M+ (7.11) 


The positive roots of this equation are the apsidal distances. 


Example 7.1 Asteroid deflected by the Sun 


A particle P of mass m moves in the central force field —(my/r*)7, where y is a 
positive constant. Show that bounded and unbounded orbits are possible depending 
on the value of E. 

An asteroid is approaching the Sun from a great distance. At this time it has 
constant speed u and is moving in a straight line whose perpendicular distance from 
the Sun is p. Find the equation satisfied by the apsidal distances of the subsequent 
orbit. For the special case in which u* = 4MoG/3p (where Mo is the mass of the 
Sun), find (i) the distance of closest approach of the asteroid to the Sun, and (ii) the 
speed of the asteroid at the time of closest approach. 


Solution 
For this law of force, V = —y/r and the effective potential V* is 
2 
y= - a LY 
ro Ore 


This V* has the form shown in Figure 7.6 (left), from which it is clear that the orbit 
will be 


(i) bounded if E < 0, 
(ii) unbounded if E > 0, 


whatever the value of L. 
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In the asteroid example, the constant y = MoG, where Mo is the mass of the 
Sun and G is the constant of gravitation. With the given initial conditions, L = pu 


and E = u*/2, so that E > 0 and the orbit is unbounded. 
The equation (7.11) for the apsidal distances becomes 


that is, 


where y = MoG. 


For the special case in which u* = 4MoG/3p, this equation simplifies to 


Or? 4. 3pr— 2p = 0. 


The distance of closest approach of the asteroid is the positive root of this quadratic 


equation, namely r = p/2. 


The speed V of the asteroid at closest approach is easily deduced from angular 
momentum conservation. Initially, L = pu and, at closest approach, L = (p/2)V. It 


follows that V = 2u. @ 


7.3. THE PATH EQUATION 


In principle, the method of the last section allows us to determine the complete 


motion of the orbiting body as a function of the time. However, the procedure is usually 


too difficult to be carried through analytically. We can make the problem easier (and 


make more progress) by seeking just the equation of the path taken by the body, and not 


enquiring where the body is on this path at any particular time. 


We start from the Newton equation (7.3) and try to eliminate the time by using the 


angular momentum equation (7.5). In doing this it is helpful to introduce the new depen- 


dent variable u, given by 


u=I1/r. 


(7.12) 


This transformation has a magically simplifying effect. We begin by transforming 7 and 


7. By the chain rule, 


. d {1 1 du da 2,\ du 
r= a x x a (- é) — 
dt \u uz dd~—s alt dé 


which, on using the angular momentum equation (7.5), gives 


du 
do 


r= 


(7.13) 
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A second differentiation with respect to ¢ then gives 


d (d d2u dé a 
Pook at ay panied fe aay (7.14) 
dt \do doz” dt do? 


on using the angular momentum equation again. 
The term r6? = L7u? so that the Newton equation (7.3) is transformed into 


d*u 
Le — L*u? = f (1/u), 


that is, 


The path equation 
Pe f(l/u) (7.15) 


do2 me oe 


This is the path equation. Its solutions are the polar equations of the paths that the body 
can take when it moves under the force field F = mf (r)r. 

Despite the appearance of the left side of equation (7.15), the path equation is not 
linear in general. This is because the right side is a function of u, the dependent variable. 
Only for the inverse square and inverse cube laws does the path equation become linear. 
It is a remarkable piece of good luck that the inverse square law (the most important case 
by far) is one of only two cases that can be solved easily. 


Initial conditions for the path equation 


Suitable initial conditions for the path equation are provided by specifying the values of u 
and du/d@ when 6 = a, say. Since u = 1/r, the initial value of u is given directly by the 
initial data. The value of du/d@ is not given directly but can be deduced from equation 
(7.13) in the form 


du _ r (7.16) 
doi’ : 


where / and L are obtained from the initial data. 


Example 7.2 Path equation for the inverse cube law 


The engines of the starship Enterprise have failed and the ship is moving in a straight 
line with speed V. The crew calculate that their present course will miss the planet 
B-—Zar by a distance p. However, B — Zar is known to exert the force 
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FIGURE 7.7 The path of the Enterprise around the planet B—Zar (B). 


on any mass ™ in its vicinity. A measurement of the constant y reveals that 


8p2V2 
9 


Y= 


Show that the crew of the Enterprise will get a free tour around B —Zar before contin- 
uing along their original path. What is the distance of closest approach and what is 
the speed of the Enterprise at that instant? 


Solution 


For the given law of force, f(r) = —y/r? so that f(1/u) = —yu?. Also, from the 
initial conditions, L = pV. The path equation is therefore 


du yur 
tus : 
de2 p2V2u2 
which simplifies to 
du i ui 0 
de> 9 


on using the stated value of y. The general solution of this equation is 
u = Acos(0/3) + B sin(@/3). 


The constants A and B can now be determined from the initial conditions. Take the 
initial line @ = 0 as shown in Figure 7.7. Then: 


(i) The initial condition 7 = co when 6 = 0 implies that u = 0 when 6 = 0. It 
follows that A = 0. 


(11) The initial condition on du/d@ is given by (7.16) to be 


dus =F. 

do LL \pv/) p 
when 6 = 0. It follows that B = 3/p. 
The required solution is therefore 


u= - sin(0/3), 
Pp 
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that is 


ee en 
3 sin(@/3) 
This is the polar equation of the path of the Enterprise, as shown in Figure 7.7. The 
Enterprise recedes to infinity when sin(@/3) = O again, that is when 0 = 3z. Thus 
the Enterprise makes one circuit of B—Zar before continuing on as before. 
The distance of closest approach is p/3 and is achieved when 0 = 37/2. By 
angular momentum conservation, the speed of the Enterprise at that instant is 3V. @ 


7.4 NEARLY CIRCULAR ORBITS 


Although the path equation cannot be solved exactly for most laws of force, it is 
possible to obtain approximate solutions when the body is slightly perturbed from a known 
orbit. In particular, this can always be done when the unperturbed orbit is a circle with 
centre O. 

Suppose that a particle P moves in a circular orbit of radius a under the attractive 
force f(r) per unit mass. This is only possible if its speed v satisfies v/a = f(a), in 
which case its angular momentum L is given by L? = a? f(a). Suppose that P is now 
slightly disturbed by a small radial impulse. The angular momentum is unchanged but P 
now moves along some new path 


1 
u=—(1+é()), 
a 


where & is a small perturbation. In terms of &, the path equation becomes 
are (+8)? a 
sr l+e=+ ; 
do? : f(a) ‘i ( Lr : 


This exact equation for € is non-linear, but we will now approximate it by expanding the 


right side in powers of €. On expanding the function f (7) in a Taylor series about r = a 


= fo (8,) r@+o(4g) 
=e | aes P+e 


= f(a) —af'(ag +0 (8), 


and a simple binomial expansion gives 


(+6)? =1-2 +0 (¢?). 


On combining these results together, the constant terms cancel and we obtain 


dé afl@\, — 
wet (3+ fla) Jeno. es 
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on neglecting terms of order O (7). This is the approximate linearised equation satisfied 
by the perturbation €(@). 

The general behaviour of the solutions of equation (7.17) depends on the sign of the 
coefficient of &. 


(i) If 
af'(a) 


3 
™ f(a) 


<0, (7.18) 


then the solutions are linear combinations of real exponentials, one of which has 
a positive exponent. In this case, the solution for € will not remain small, contrary 
to assumption. The conclusion is that the original circular orbit is unstable. 


Gi) Alternatively, if 


af'(a) 
f(a) 


a > 0, (7.19) 
then the solutions are linear combinations of real cosines and sines, which remain 
bounded. The conclusion is that the original circular orbit is stable (at least to 
small radial impulses). 


Closure of the perturbed orbits 


From now on we will assume that the stability condition (7.19) is satisfied. The general 
solution of equation (7.17) then has the form 


E = Acos 026 + B sin Q6. 


We see that the perturbed orbit will close itself after one revolution if Q is a positive 
integer. When the law of force is the power law 


(cab sae 


the perturbed orbit is stable for v > —3 and will close itself after one revolution if 


where mm is a positive integer. The case m = 1 corresponds to inverse square attraction 
and m = 2 corresponds to simple harmonic attraction. The exponents v = 6, 13, ... 
are also predicted to give closed orbits. It should be remembered though that these are 
only the predictions of the approximate linearised theory.* It is possible (but not pretty) 
to improve on the linear approximation by including quadratic terms in € as well as linear 
ones. The result of this refined theory is that the powers v = —2 and v = 1 still give 


* It makes no sense to say that an orbit approximately closes itself! 
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closed orbits, but the powers v = 6, 13, ... do not. This shows that the power laws with 
v = 6, 13, ... do not give perturbed orbits that close after one revolution, but the cases 
v = —2 and v = 1 are still not finally decided. Mercifully, there is no need to carry the 


approximation procedure any further because all the paths corresponding to both inverse 
square and simple harmonic attraction can be calculated exactly. It is found that, for 
these two laws of force, all bounded orbits close after one revolution.* There remains the 
possibility that the perturbed orbits might close themselves after more than one revolution, 
but a similar analysis shows that this does not happen. We have therefore shown that the 
only power laws for which all bounded orbits are closed are the simple harmonic and 
inverse square laws. This result is actually true for all central fields (not just power laws) 
and is known as Bertrand’s theorem. 


Precession of the perihelion of Mercury 
The fact that the inverse square law leads to closed orbits, whilst very similar laws do not, 
provides an extremely sensitive test of the law of gravitation. Suppose for instance that 
the attractive force experienced by a planet were 


bc ae 
I) Fane 


(per unit mass), where y > O and |e| is small. Then the value of © for a nearly circular 
orbit is 


Q= (1-6)? =1-5€+ Oe’). 


This perturbed orbit does not close but has apsidal angle w, where 


IU IU 
a=—= =n(1+ te) + O(e?). 
Q 1-46 +0(€?) 


Hence successive perihelions of the planet will not occur at the same point, but the per- 
ihelion will advance ‘annually’ by the small angle ze. The position of the perihelion 
of a planet can be measured with great accuracy. For the planet Mercury it is found 
(after all known perturbations have been subtracted out) that the perihelion advances by 
43 (0.5) seconds of arc per century, or 5 x 10~7 radians per revolution. This correponds 
to « = 1.6 x 107’ and a power of —2.00000016 instead of —2. Miniscule though this 
discrepancy from the inverse square law seems, it is considerably greater than the error in 
the observations and for a considerable time was something of a puzzle. 

This puzzle was resolved in a striking fashion by the theory of general relativity, 
published by Einstein in 1915. Einstein showed that one consequence of his theory was 
that planetary orbits should precess slightly and that, in the case of Mercury, the rate of 
precession should be 43 seconds of arc per century! 


* Tn the inverse square case, the bounded orbits are ellipses with a focus at O, and, in the simple harmonic 
case, they are ellipses with the centre at O. 
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7.5 THE ATTRACTIVE INVERSE SQUARE FIELD 


Because of its many applications to astronomy, the attractive inverse square field 
is the most important force field in the theory of orbits. The same field occurs in particle 
scattering when the two particles carry unlike electric charges. Because of these important 
applications, we will treat the inverse square field in more detail than other fields. In par- 
ticular, we will obtain formulae that enable inverse square problems to be solved quickly 
and easily without referring to the equations of motion at all. 


The paths 
Suppose that f(r) = —y/r? where y > 0. Then f(1/u) = —yu? and the path equation 
becomes 

du y 

dee = 7 


where L is the angular momentum of the orbit. This has the form of the SHM equation 
with a constant on the right. The general solution is 


u = Acos@ + B sind + 45, 


which can be written in the form 


: = Bll + ecos(é — @)), (7.20) 
where e, w are constants with e > 0. This is the polar equation of a conic of eccentricity 
e and with one focus at O; @ is the angle between the major axis of the conic and the initial 
line 6 = 0. If e < 1, then the conic is an ellipse; if e = 1 then the conic is a parabola; 
and when e > 1 the conic is the near branch of a hyperbola. The neccessary geometry 
of the ellipse and hyperbola is summarised in Appendix A at the end of the chapter; the 
special case of the parabolic orbit is of marginal interest and we will make little mention 
of it. 


Kepler’s first law 


It follows from the above that the only bounded orbits in the attractive inverse square field 
are ellipses with one focus at the centre of force. This is Kepler’s first law, which is 
therefore a consequence of inverse square law attraction by the Sun. It would not be true 
for other laws of force. 


The L-formula and the E-formula 


By comparing the path formula (7.20) with the standard polar forms given in Appendix 
A, we see that the angular momentum L of the orbit is related to the conic parameters a, 
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b by the formula 


that is, 


The L-formula 
(7.21) 


= yb’ /a 


We will call this result the L-formula. It applies to both elliptic and hyperbolic orbits. 
It is the first of two important formulae that relate L, E, the dynamical constants of the 
motion, to the conic parameters of the resulting orbit. 

The second such formula involves the energy E. At the point of closest approach 
r=c, 


where V is the speed of P whenr = c. Since P is moving transversely at the point of 
closest approach, it follows that cV = L, so that E may be written 


on using the L-formula. 

From this point on, the different types of conic must be treated separately. When the 
orbit is an ellipse, c = a(1 — e), where e is the eccentricity, and a, b and e are related by 
the formula 


b2 
2 
eS = a 
Then £ can be written 
Pee eee, y 
2a3(1—e)* a(l—e) 
Sere 
2a 


Thus the total energy E in the orbit is directly connected to a, the semi-major axis of the 
elliptical orbit. The parabolic and hyperbolic orbits are treated similarly and the full result, 
which we will call the E-formula, is 
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FIGURE 7.8 The asteroid A moves on a hyperbolic orbit 
around the Sun S as a focus and is deflected through the 


angle p. 
The E-formula 
: 4 
Ellipse: E <0 E=-— 
2a 
(7.22) 

Parabola: E=0 
4 
Hyperbola: E>0 E= sere 
a 


Note that the type of orbit is determined solely by the sign of the total energy FE. It 
follows that that the escape condition (the condition that the body should eventually go 
off to infinity) is simply that E > 0. This useful result is true only for the inverse square 
law. 


Example 7.3 Asteroid deflected by the Sun 


An asteroid approaches the Sun with speed V along a line whose perpendicular dis- 
tance from the Sun is p. Find the angle through which the asteroid is deflected by the 
Sun. 


Solution 

In this case we have the attractive inverse square field with y = MoG, where Me 

is the mass of the Sun. This problem can be solved from first principles by using the 

path equation, but here we make short work of it by using the L- and E-formulae. 
From the initial conditions, L = pV and E = V2. Since E > 0, the orbit is the 

near branch of a hyperbola and the L- and E-formulae give 


MoGb2 MoG 
=o and tye at ay 
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It follows that 


MoG 
The semi-angle @ between the asymptotes of the hyperbola is then given (see 
Appendix A) by 
2 
tana =—-= By ‘ 
a MoG 
Let 6 be the angle through which the asteroid is deflected. Then (see Figure 7.8) 
B =a — 2a and 
MoG 
tan(B/2) = tan(z/2 —a@) = cota = al 
oy 


Period of the elliptic orbit 


Whatever the law of force, once the path of P has been found, the progress of P along 
that path can be deduced from the angular momentum equation 


ro=lL. 


If we take 9 = 0 when ¢ = 0, then the time ¢ taken for P to progress to the point of the 
orbit with polar coordintes 7, 0 is given by 


1 0 
t= al rd, (7.23) 
0 


where r = r(@) is the equation of the path. In particular then, the period t of the elliptic 
orbit is given by 


where the pathr = r(@) is given by 
== eeesd) (7.24) 
= cos @). : 
rb ss 


Fortunately there is no need to evaluate the above integral since, for any path that closes 


20 
sf r°d0=A, 
0 


where A is the area enclosed by the path. For the elliptical path, A = zab so that 


itself after one circuit, 


21ab 
t= : 
L 
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and on using the L-formula, the period of the elliptic orbit is given by: 


The period formula 


“) v (7.25) 


t=2n | — 


Kepler’s third law 


In the case of the planetary orbits, y = MoG, where Mo is the mass of the Sun. Equation 
(7.25) can then be written 


oe (ale (7.26) 

T= a. ‘ 
MoG 

This is Kepler’s third law, which is therefore a consequence of inverse square law attrac- 

tion by the Sun and would not be true for other laws of force. 


Masses of celestial bodies 


Once the constant of gravitation G is known, the formula (7.26) provides an accurate 
way to find the mass of the Sun. The same method applies to any celestial body that has 
a Satellite. All that is needed is to measure the major axis 2a and the period t of the 
satellite’s orbit.* 


Question Finding the mass of Jupiter 


The Moon moves in a nearly circular orbit of radius 384,000 km and period 27.32 
days. Callisto, the fourth moon of the planet Jupiter, moves in a nearly circular orbit 
of radius 1,883,000 km and period 16.69 days. Estimate the mass of Jupiter as a 
multiple of the mass of the Earth. 


Answer 
M, =316M._. 


Astronomical units 


For astronomical problems, it is useful to write the period equation (7.26) in astronomical 
units. In these units, the unit of mass is the mass of the Sun (Mo), the unit of length (the 
AU) is the semi-major axis of the Earth’s orbit, and the unit of time is the (Earth) year. On 


* Tt should be noted that here we are neglecting the motion of the centre of force. We will see later that, 
when this is taken into account, formula (7.26) actually gives the sum of the masses of the body and 
its satellite. Usually, the satellite has a much smaller mass than the body and its contribution can be 
disregarded. 
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FIGURE 7.9 The eccentric angle wy corresponding to the polar 
angle 0. 


substituting the data for the Earth and Sun into equation (7.26), we find that G = 4zr? in 
astronomical units. Hence, in astronomical units the period formula becomes 


Question The major axis of the orbit of Pluto 


The period of Pluto is 248 years. What is the semi-major axis of its orbit? 


Answer 
39.5 AU. & 


Time dependence of the motion — Kepler’s equation 


The formula (7.23) can be used to find how long it takes for P to progress to a general point 
of the orbit. However, although the integration with respect to 6 can be done in closed 
form, it is a very complicated expression. In order to obtain a manageable formula, we 
make a cunning change of variable, replacing the polar angle 6 by the eccentric angle wy. 
The relationship between these two angles is shown in Figure 7.9. Since CN =CF+FN, 
it follows that 


acosy = ae+rcos6, 


and, on using the polar equation for the ellipse (7.24) together with the formula b? = 
a?(1 — e”), the relation between y and 6 can be written in the symmetrical form 

b2 
(1 —ecosw)(1 + ecos@) = 5 (7.27) 
a 


Implicit differentiation of equation (7.27) with respect to y then gives 


do b 
dw  a(l—ecoswp)’ 


(7.28) 
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after more manipulation. 
We can now make the change of variable from 6 to w. From (7.23) and (7.24) 


bt [ do 
— 
a*L Jo (1+ecosé)2 


_ oe f? 1 a0 | 
=r |, (1 +ecos6)2 (=) ¥ 


v 
-?f (1 —ecosw)dvy, 
L Jo 


__ ab . 
ae i dae 


on using (7.27), (7.28). Finally, on making use of the L-formula L? = yb /a, we obtain 


Kepler’s equation 

z (7.29) 

t= — (WwW -esiny) 
20 


where t (given by (7.25)) is the period of the orbit. This is Kepler’s equation which gives 
the time as a function of position on the elliptical orbit. 

If one needs to calculate the position of the orbiting body after a given time, then 
equation (7.29) must be solved numerically for the eccentric angle y. The corresponding 
value of @ is then given by equation (7.27) and the r value by equation (7.24) which, in 
view of (7.27), can be written in the form 


r=a(l—-ecoswp). (7.30) 


The need to solve Kepler’s equation for the unknown w was a major stimulus in the 
development of approximate numerical methods for finding roots of equations. 


Example 7.4 Kepler’s equation 


A body moving in an inverse square attractive field traverses an elliptical orbit with 
eccentricity e and period t. Find the time taken for the body to traverse the half of 
the orbit that is nearer the centre of force. 


Solution 


The half of the orbit nearer the centre of force corresponds to the range —7/2 < py < 
z/2. The time taken is therefore 


T (0 42 
ae ea 
mw \2 2 #2 
For example, Halley’s comet moves on an elliptic orbit whose eccentricity is almost 


unity. It therefore spends only about 18% of its time on the half of its orbit that is 
nearer the Sun. 
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FIGURE 7.10 Two planets move on the circular 
orbits A and B. A spacecraft is required to 
depart from one planet and rendezvous with the 
other planet at some point of its orbit. The 
Hohmann orbit 7 achieves this with the least 
expenditure of fuel. 


7.6 SPACE TRAVEL —- HOHMANN TRANSFER ORBITS 


An important problem in space travel, and one that nicely illustrates the preceding 
theory, is that of transferring a spacecraft from one planet to another (from Earth to Jupiter 
say). In order to simplify the analysis, we will assume that both of the planetary orbits are 
circular. We will also suppose that the spacecraft has already effectively been removed 
from Earth’s gravity, but is still in the vicinity of the Earth and is orbiting the Sun on the 
same orbit as the Earth. The object is to use the rocket motors to transfer the spacecraft 
to the vicinity of Jupiter, orbiting the Sun on the same orbit as Jupiter. Like everything 
else on board a spacecraft, fuel has to be transported from Earth at huge cost, so the 
transfer from Earth to Jupiter must be achieved using the east mass of fuel. In our analysis 
we will neglect the time during which the rocket engines are firing so that the engines 
are regarded as delivering an impulse to the spacecraft, resulting in a sudden change of 
velocity. After the initial firing impulse, the spacecraft is assumed to move freely under 
the Sun’s gravitation until it reaches the orbit of Jupiter, when a second firing impulse is 
required to circularise the orbit. This is called a two-impulse transfer. 

If the two firings produce velocity changes of Av“ and Av® respectively, then the 
quantity Q that must be minimised if the least fuel is to be used is 


O = |Av4| + |Av? |. 


The orbit that connects the two planetary orbits and minimises Q is called the Hohmann 
transfer orbit* and is shown in Figure 7.10. It has its perihelion at the lift-off point L and 
its aphelion at the rendezvous point R. It is not at all obvious that this is the optimal orbit; 
a proof is given in Appendix B at the end of the chapter. However, it is quite easy to find 
its properties. 

Since the perihelial and aphelial distances in the Hohmann orbit are A and B (the radii 
of the orbits of Earth and Jupiter), it follows that 


A=a(1-e), B=a(1+e), 


* After Walter Hohmann, the German space research pioneer. 
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so that the geometrical parameters of the orbit are given by 


1 (B+ A) a 
a=5 ; e= ——.. 
2 BoA 


The angular momentum L of the orbit is then given by the L-formula to be 


be BA 
=P =y(1-e)a=2—, 
a BA 


where y = MoG. 
From L we can find the speed V“ of the spacecraft just after the lift-off firing, and 
the speed V * at the rendezvous point just before the second firing. These are 


gt (2B \ ge f 2rd)” 
A(B+A)) ’ B(IB+A)) — 


The travel time 7, which is half the period of the Hohmann orbit, is given by 


ie na? _ n? (B+ A)* 
Y 8y 

Finally, in order to rendezvous with Jupiter, the lift-off must take place when Earth 

and Jupiter have the correct relative positions, so that Jupiter arrives at the meeting point 


1/2 


at the right time. Since the speed of Jupiter is (y/B)°/~ and the travel time is now known, 


the angle w in Figure 7.10 must be 


Numerical results for the Earth—Jupiter transfer 


In astronomical units, G = An?, A = 1 AU and, for Jupiter, B = 5.2 AU. A speed of 1 AU 
per year is 4.74 km per second. Simple calculations then give: 


(i) The travel time is 2.73 years, or 997 days. 
(ii) V“ is 8.14 AU per year, which is 38.6 km per second. This is the speed the spacecraft 
must have after the lift-off firing. 
(iii) V* is 1.56 AU per year, which is 7.4 km per second. This is the speed with which the 
spacecraft arrives at Jupiter before the second firing. 
(iv) The angle y at lift-off must be 83°. 


The speeds V“ and V¥ should be compared with the speeds of Earth and Jupiter in their 
orbits. These are 29.8 km/sec and 13.1 km/sec respectively. Thus the first firing must boost the 
speed of the spacecraft from 29.8 to 38.6 km/sec, and the second firing must boost the speed 
from 7.4 to 13.1 km/sec. The sum of these speed increments, 14.5 km/sec, is greater than 
the speed increment needed (12.4 km/sec) to escape from the Earth’s orbit to infinity. Thus 
it takes more fuel to transfer a spacecraft from Earth’s orbit to Jupiter’s orbit than it does to 
escape from the solar system altogether! 
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7.7. THE REPULSIVE INVERSE SQUARE FIELD 


The force field with f(r) = +y/r, (y > 0), is the repulsive inverse square 
field. It occurs in the interaction of charged particles carrying like charges and is required 
for the analysis of Rutherford scattering. Below we summarise the important properties 
of orbits in a repulsive inverse square field. These results are obtained in exactly the same 
way as for the attractive case. 


The paths 
The path equation is 
d*u 
eee, 
where L is the angular momentum of the orbit. Its general solution can be written in the 
form 
ee ee Crs 
ee oe ; 


where e, @ are constants with e > 0. By comparing this path with the standard polar 
forms of conics given in Appendix A, we see that the path can only be the far branch of a 
hyperbola with focus at the centre O. 

The L- and E-formulae 

The formulae relating L, E, the dynamical constants of the orbit, to the hyperbola param- 


eters are 


L? = yb*/a, (7.31) 


E=+y/2a. (7.32) 


7.8 RUTHERFORD SCATTERING 


The most celebrated application of orbits in a repulsive inverse square field is 
Rutherford’s* famous experiment in which a beam of alpha particles was scattered by 
gold nuclei in a sheet of gold leaf. We will analyse Rutherford’s experiment in detail, 
beginning with the basic problem of a single alpha particle being deflected by a single 
fixed gold nucleus. 


Alpha particle deflected by a heavy nucleus 


An alpha particle A of mass m and charge g approaches a gold nucleus B of charge Q 
(see Figure 7.11). B is initially at rest and A is moving with speed V along a line whose 


* Emest Rutherford (1871-1937), a New Zealander, was one of the greatest physicists of the twentieth 
century. His landmark work on the structure of the nucleus in 1911 (and with Geiger and Marsden in 
1913) was conducted at the University of Manchester, England. 
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FIGURE 7.11 The alpha particle A of mass m and charge 
q is repelled by the fixed nucleus B of charge Q and 
moves on a hyperbolic orbit with the nucleus at the far 
focus. The alpha particle is deflected through the angle @. 


perpendicular distance from B is p. In the present treatment, we neglect the motion of the 
gold nucleus. This is justified since the mass of the gold nucleus is about fifty times larger 
than that of the alpha particle. A then moves in the electrostatic field due to B, which we 
now suppose to be fixed at the origin O. The force exerted on A is then 


Pod oy 
‘ 


in cgs units. This is the repulsive inverse square field with y = gQ/m. 
We wish to find 6, the angle through which the alpha particle is deflected. This is 
obtained in exactly the same way as that of the asteroid in Example 7.1. From the initial 


conditions, L = pV and E = 5V?. The L-formula (7.31) and the E-formula (7.32) then 
give 


2 
2172 _ YO 172 Y 
V-=—_, zV° =4+—. 
P a 2 2a 
It follows that 
Y 
Oe 7a? b=p. 


The semi-angle a between the asymptotes of the hyperbola is then given (see Appendix 

A) by 

pV? 

tang = — = ——. 
a Y 


Hence, 6, the angle through which the asteroid is deflected, is given by 


Y 


tan(0/2) = tan(7r/2 — a) = cota = —;. 
pv? 
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FIGURE 7.12 General scattering. A typical particle crosses the reference plane at the point p and 
finally emerges in the direction of the unit vector n. Particles that cross the reference plane within the 
region A emerge within the (generalised) cone shown. 


On writing y = gQ/m, we obtain 


qQ 
mpV2— 


tan(@/2) = (7.33) 
as the formula for the deflection angle of the alpha particle. The quantity p, the distance 
by which the incident particle would miss the scatterer if there were no interaction, is 
called the impact parameter of the particle. 

The deflection formula (7.33) cannot be confirmed directly by experiment since this 
would require the observation of a single alpha particle, a single nucleus, and a knowledge 
of the impact parameter p. What is actually done is to irradiate a gold target by a uniform 
beam of alpha particles of the same energy. Thus the target consists of many gold nuclei 
together with their associated electrons. However, the electrons have masses that are very 
small compared to that of an alpha particle and so their influence can be disregarded. 
Also, the gold target is taken in the form of thin foil to minimise the chance of multiple 
collisions. If multiple collisions are eliminated, then the gold nuclei act as independent 
scatterers and the problem reduces to that of a single fixed gold nucleus irradiated by a 
uniform beam of alpha particles. In this problem the alpha particles come in with different 
values of the impact parameter p and are scattered through different angles in accordance 
with formula (7.33). What can be measured is the angular distribution of the scattered 
alpha particles. 


Differential scattering cross-section 


The angular distribution of scattered particles is expressed by a function o(), called the 
differential scattering cross section, where the unit vector n specifies the final direction 
of emergence of a particle from the scatterer O. One may imagine the values of n corre- 
sponding to points on the surface of a sphere with centre O and unit radius, as shown in 
Figure 7.12. Then values of n that lie in the shaded patch S correspond to particles whose 
final direction of emergence lies inside the (generalised) cone shown. 


182 Chapter 7 Orbits in a central field 


angle 4, 


radius p, 


FIGURE 7.13 Axisymmetric scattering. Particles crossing the reference plane 
within the shaded circular disk are scattered and emerge in directions within the 
circular cone. 


Take a reference plane far to the left of the scatterer and perpendicular to the incident 
beam, as shown in Figure 7.12. Suppose that there is a uniform flux of incomimg parti- 
cles crossing the reference plane such that N particles cross any unit area of the reference 
plane in unit time. When these particles have been scattered, they will emerge in different 
directions and some of the particles will emerge with directions lying within the (gener- 
alised) cone shown in Figure 7.12. The differential scattering cross section is defined 
to be that function o(n) such that the flux of particles that emerge with directions lying 
within the cone is given by the surface integral 


nf a(n)dS. (7.34) 
S 


It is helpful to regard o (m) as a scattering density, analogous to a probability density, that 
must be integrated to give the flux of particles scattered within any given solid angle. 

The particles that finally emerge within the cone must have crossed the reference plane 
within some region A as shown in Figure 7.12. A typical particle crosses the reference 
plane at the point p (relative to O’) and eventually emerges in the direction n lying within 
the cone. However because the incoming beam is uniform, the flux of these particles 
across A is just N|.A|, where |.A| is the area of the region A. On equating the incoming 
and outgoing fluxes, we obtain the relation 


i a(n)dS = |A|. (7.35) 
S 


This is the general relation that any differential scattering cross section must satisfy; it 
simply expresses the equality of incoming and outgoing fluxes of particles. However, 
Rutherford scattering is axisymmetric and this provides a major simplification. 


Axisymmetric scattering and Rutherford’s formula 


Rutherford scattering is simpler than the general case outlined above in that the problem 
is axisymmetric about the axis O’O. Thus o depends on 6 (the angle between n and the 
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axis O’O), but is independent of @ (the azimuthal angle measured around the axis). In 
this case o (@) can be determined by using the axisymmetric regions shown in Figure 7.13. 
Particles that cross the reference plane within the circle centre O' and radius p; emerge 
within the circular cone 0; < 6 < 2, where p and 0, are related by the deflection formula 
for a single particle, in our case formula (7.33). On applying equation (7.35) to the present 
case, we obtain 


/ o(0)dS = xp?. 
S 


We evaluate the surface integral using 6, ¢ coordinates. The element of surface area on 
the unit sphere is given by dS = sin@ d@d¢ so that 


sa 20 
[owas -| {1 cosines dé 
S (cal 0 


1s 
= an f o(@)siné dé. 
6 


1 


Hence 


1s 
anf o(0)sin@ dé = xp} 
6 


1 
PI 
=2 f pdp 
0 


1s 
d 
= -2n | pao, 
6, 0 


on changing the integration variable from p to 6. Here the impact parameter p is regarded 
as a function of the scattering angle 6. Now the above equality holds for all choices of the 
integration limit 6; and this can only be true if the two integrands are equal. Hence: 


Axisymmetric scattering cross section 


0) = (5) 36 


(7.36) 


This is the formula for the differential scattering cross section o in any problem of axisym- 
metric scattering. All that is needed to evaluate it in any particular case is the expression 
for the impact parameter p in terms of the scattering angle 0. 

In the case of Rutherford scattering, the expression for p in terms of 0 is provided 
by solving equation (7.33) for p, which gives 


qQ 


mvV2 


p= tan(@/2). 
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On substituting this function into the formula (7.36), we obtain 


Rutherford’s scattering cross-section 


o(0) = 


16E* \sin*(9/2) 


where FE (= 5mV) is the energy of the incident alpha particles. This is Rutherford’s 
formula for the angular distribution of the scattered alpha particles. 


Significance of Rutherford’s experiment 


In the above description we have used the term ‘nucleus’ for convenience. What we 
really mean is ‘the positively charged part of the atom that carries most of the mass’. 
If this positive charge is distributed in a spherically symmetric manner, then the above 
results still hold, irrespective of the radius of the charge, provided that the alpha particles 
do not penetrate into the charge itself. What Rutherford found was that, when using 
alpha particles from a radium source, the formula (7.37) held even for particles that were 
scattered through angles close to z. These are the particles that get closest to the nucleus, 
the distance of closest approach being gQ/E. This meant that the nuclear radius of gold 
must be smaller than this distance, which was about 10~!* cm in Rutherford’s experiment. 
The radius of an atom of gold is about 10~* cm. This result completely contradicted the 
Thompson model, in which the positive charge was distributed over the whole volume of 
the atom, by showing that the nucleus (as it became known) must be a very small and very 
dense core at the centre of the atom. 


Note on two-body scattering problems 


Throughout this section we have neglected the motion of the target nucleus. This will introduce 
only small errors when the target nucleus is much heavier than the incident particles, as it was 
in Rutherford’s experiment. However, if lighter nuclei are used as the target, then the motion of 
the nucleus cannot be neglected and we have a two-body scattering problem. Such problems 
are treated in Chapter 10. 


Appendix A The geometry of conics 


Ellipse 


(i) In Cartesian coordinates, the standard ellipse with semi-major axis a and semi-minor axis b (b < a) 
has the equation 
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(0, b) 


FIGURE 7.14 The standard ellipse x*/a* + y?/b? = 1. 


near branch far branch 


FIGURE 7.15 The standard hyperbola x7/a? — y*/b* = 1. The 
near and far branches are relative to the focus F’, which is the 
origin of polar coordinates. 


(ii) The eccentricity e of the ellipse is defined by 


and lies in the range 0 < e < 1. When e = 0, b = a and the ellipse is a circle. 

(iii) The focal points F’, F’ of the ellipse lie on the major axis at (tae, 0). 

(iv) In polar coordinates with origin at the focus F and with initial line in the positive x-direction, the 
equation of the ellipse is 


De ee eoat 
pe é€ COS 7 
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B 


FIGURE 7.16 The circular orbits A and B are the orbits of the two planets. The 
elliptical orbit shown is a possible path for the spacecraft, which travels along 
the arc LR. The velocities shown are those after the first firing at L and before 
the second firing at R. 


Hyperbola 

(i) In Cartesian coordinates, the standard hyperbola has the equation 
eee b>0 
a - a (a,b > 0) 


so that the angle 2~ between the asymptotes is given by 
b 
tana = -. 
a 


(ii) The eccentricity e of the hyperbola is defined by 


and lies in the range e > 1. 
(iii) The focal points F, F’ of the hyperbola lie on the x-axis at (ae, 0). 


(iv) In polar coordinates with origin at the focus F’ and with initial line in the positive x-direction, the 


equations of the near and far branches of the hyperbola are 
= ieee, 22 daosesy 
-=— cos 0), -=—(- cos 8), 
r pe 7 rp? i 


respectively. 


Appendix B The Hohmann orbit is optimal 


The result that the Hohmann orbit is the connecting orbit that minimises Q is not at all 
obvious and correct proofs are rare.* Hopefully, the proof given below is correct! 


* Tt is sometimes stated that the optimality requirement is to minimise the energy of the connecting orbit, 
which is not true. In any case, the Hohmann orbit is not the connecting orbit of minimum energy! 


Appendix B_ The Hohmann orbit is optimal 187 


Proof of optimality Consider the general two-impulse transfer orbit LR shown in Figure 7.16, where 
the orbit is regarded as being generated by the velocity components vey uA of the spacecraft after the first 
impulse. Then, by angular momentum and energy conservation, 


A _ B 
Avg = Buy, 


2 2 Y 2 2 2 
(7) +08) - Zar) +(e) ~ a 


where A, B are the radii of the circular orbits of Earth and Jupiter and y = MoG. Thus 


(= (1-2) ()"+ (A) 2-3) 


Since the orbital speeds of Earth and Jupiter are (y / A)!/2 and (y/B) 1/2 it follows that the velocity changes 
Av4, Av? required at L and R have magnitudes given by 


|Av4/2 = (vd 7 (e)")'4 (oA), 


2 
2A 2 ; 2 - 
= (St) +620 ar(3-3-8 
(¥ a) + (Uv, +r(5 A Bat: 
It is evident that, with ue fixed, both |Av4| and |Av? | are increasing functions of oe, Thus Q may be 


reduced by reducing uf provided that the resulting orbit still meets the circle = B. Q can be thus reduced 
until either 


(i) uA is reduced to zero, or 
(ii) the orbit shrinks until it touches the circle r = B and any further reduction in vA would mean that 
the orbit would not meet r = B. 


In the first case, L becomes the perihelion of the orbit and, in the second case, R becomes the aphelion of 
the orbit. We will proceed assuming the first case, the second case being treated in a similar manner and 
with the same result. 

Suppose then that L is the perihelion of the connecting orbit. Then uf = 0 and, from now on, we 
will simply write v instead of De The velocity v must be such that the orbit reaches the circler = B, 
which now means that the major axis of the orbit must not be less than A + B. On using the E-formula, this 
implies that v must satisfy 


Be a. 
— A(A + B) 


The formulae for |Av4| and |Av? | now simplify to 


188 Chapter 7 Orbits in a central field 


from which it is evident that, for v in the permitted range, both of |AvA| and |Av8 | are increasing functions 
v. Hence the minimum value of Q is achieved when v takes its smallest permitted value, namely 


oyB \¥2 
= (Gan) 


With this value of v, the orbit touches the circle r = B and so has its aphelion at R. Hence the optimum 
orbit has its perihelion at L and its aphelion at R. This is precisely the Hohmann orbit. 


Problems on Chapter 7 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Radial motion equation, apses 


7.1 A particle P of mass m moves under the repulsive inverse cube field F = (my/r?)7. 
Initially P is at a great distance from O and is moving with speed V towards O along a 
straight line whose perpendicular distance from O is p. Find the equation satisfied by the 
apsidal distances. What is the distance of closest approach of P to O? 


7.2 A particle P of mass m moves under the attractive inverse square field F = —(my/r2)7. 
Initially P is at a point C, a distance c from O, when it is projected with speed (y/c)!/2 
direction making an acute angle a with the line OC. Find the apsidal distances in the resulting 
orbit. 

Given that the orbit is an ellipse with O at a focus, find the semi-major and semi-minor 
axes of this ellipse. 


ina 


7.3 A particle of mass m moves under the attractive inverse square field F = —(my/r?)?. 
Show that the equation satisfied by the apsidal distances is 


2h yk 2yr — L?= 0; 


where E and L are the specific total energy and angular momentum of the particle. When 
E < 0, the orbit is known to be an ellipse with O as a focus. By considering the sum and 
product of the roots of the above equation, establish the elliptic orbit formulae 


Le = yb? /a, E=-—-y/2a. 


7.4 A particle P of mass m moves under the simple harmonic field F = —(mQ?r)7, where 
Q is a positive constant. Obtain the radial motion equation and show that all orbits of P are 
bounded. 

Initially P is at a point C, a distance c from O, when it is projected with speed Qc in 
a direction making an acute angle a with OC. Find the equation satisfied by the apsidal 
distances. Given that the orbit of P is an ellipse with centre O, find the semi-major and semi- 
minor axes of this ellipse. 
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Path equation 


7.5 A particle P moves under the attractive inverse square field F = —(my/r7)F. Initially 
P is at the point C, a distance c from O, and is projected with speed (3 /c)!/? perpendicular 
to OC. Find the polar equation of the path make a sketch of it. Deduce the angle between OC 
and the final direction of departure of P. 


7.6 A comet moves under the gravitational attraction of the Sun. Initially the comet is at a 
great distance from the Sun and is moving towards it with speed V along a straight line whose 
perpendicular distance from the Sun is p. By using the path equation, find the angle through 
which the comet is deflected and the distance of closest approach. 


7.7 A particle P of mass m moves under the attractive inverse cube field F = —(my*/r>)F, 
where y is a positive constant. Initially P is at a great distance from O and is projected 
towards O with speed V along a line whose perpendicular distance from O is p. Obtain the 
path equation for P. 

For the case in which 


_ iy 
4/209 p’ 


find the polar equation of the path of P and make a sketch of it. Deduce the distance of closest 
approach to O, and the final direction of departure. 


7.8% A particle P of mass m moves under the central field F = —(my?/r>)¥, where y is 
a positive constant. Initially P is at a great distance from O and is projected towards O with 
speed /2y/p? along a line whose perpendicular distance from O is p. Show that the polar 
equation of the path of P is given by 


Make a sketch of the path. 


7.9% A particle of mass m moves under the central field 


F 2(4 + a\ 5 
=-m r, 
VN 


where y and a are positive constants. Initially the particle is at a distance a from the centre of 
force and is projected at right angles to the radius vector with speed 3y //2a. Find the polar 
equation of the resulting path and make a sketch of it. 

Find the time taken for the particle to reach the centre of force. 


Nearly circular orbits 


7.10 A particle of mass m moves under the central field 


—er/a 
F=-m (¥ yr 
r2 
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where y, a and € are positive constants. Find the apsidal angle for a nearly circular orbit of 
radius a. When € is small, show that the perihelion of the orbit advances by approximately zre 
on each revolution. 


7.11 Solar oblateness A planet of mass m moves in the equatorial plane of a star that is a 
uniform oblate spheroid. The planet experiences a force field of the form 


approximately, where y, a and € are positive constants and € is small. If the planet moves 
in a nearly circular orbit of radius a, find an approximation to the ‘annual’ advance of the 
perihelion. [It has been suggested that oblateness of the Sun might contribute significantly to 
the precession of the planets, thus undermining the success of general relativity. This point has 
yet to be resolved conclusively. ] 


7.12 Suppose the solar system is embedded in a dust cloud of uniform density p. Find an 
approximation to the ‘annual’ advance of the perihelion of a planet moving in a nearly circular 
orbit of radius a. (For convenience, let 9p = «M /a, where M is the solar mass and € is 
small.) 


7.13 Orbits in general relativity In the theory of general relativity, the path equation for a 
planet moving in the gravitational field of the Sun is, in the standard notation, 


u 


d*u MG (3MG\ 5 
+ _ u ’ 
do L2 


C2 


where c is the speed of light. Find an approximation to the ‘annual’ advance of the perihelion 
of a planet moving in a nearly circular orbit of radius a. 


Scattering 


7.14 A uniform flux of particles is incident upon a fixed hard sphere of radius a. The particles 
that strike the sphere are reflected elastically. Find the differential scattering cross section. 


7.15 A uniform flux of particles, each of mass m and speed V, is incident upon a fixed scat- 
terer that exerts the repulsive radial force F = (my*/r3)*. Find the impact parameter p as 
a function of the scattering angle 0, and deduce the differential scattering cross section. Find 
the total back-scattering cross-section. 


Assorted inverse square problems 
Some useful data: 


The radius R of the Earth is 6380 km. To obtain the value of MG, where M is the mass of the 
Earth, use the formula MG = R*g, where g = 9.80 ms~?. 


1 AU per year is 4.74 km per second. In astronomical units, G = 477. 
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7.16 In Yuri Gagarin’s first manned space flight in 1961, the perigee and apogee were 181 
km and 327 km above the Earth. Find the period of his orbit and his maximum speed in the 
orbit. 


7.17 An Earth satellite has a speed of 8.60 km per second at its perigee 200 km above the 
Earth’s surface. Find the apogee distance above the Earth, its speed at the apogee, and the 
period of its orbit. 


7.18 A spacecraft is orbiting the Earth in a circular orbit of radius c when the motors are 
fired so as to multiply the speed of the spacecraft by a factor k(k > 1), its direction of 
motion being unaffected. [You may neglect the time taken for this operation.] Find the range 
of k for which the spacecraft will escape from the Earth, and the eccentricity of the escape 
orbit. 


7.19 A spacecraft travelling with speed V approaches a planet of mass M along a straight 
line whose perpendicular distance from the centre of the planet is p. When the spacecraft 
is at a distance c from the planet, it fires its engines so as to multiply its current speed by a 
factor k(O < k < 1), its direction of motion being unaffected. [You may neglect the time 
taken for this operation.] Find the condition that the spacecraft should go into orbit around the 
planet. 


7.20 A body moving in an inverse square attractive field traverses an elliptical orbit with major 
axis 2a. Show that the time average of the potential energy V = —y/r is —y/a. [Transform 
the time integral to an integral with repect to the eccentric angle w.] 

Deduce the time average of the kinetic energy in the same orbit. 


7.21 A body moving in an inverse square attractive field traverses an elliptical orbit with 
eccentricity e and major axis 2a. Show that the time average of the distance r of the body 
from the centre of force is a(1 + xe’). [Transform the time integral to an integral with respect 
to the eccentric angle y.] 


7.22 A spacecraft is ‘parked’ in a circular orbit 200 km above the Earth’s surface. The space- 
craft is to be sent to the Moon’s orbit by Hohmann transfer. Find the velocity changes Av’ 
and Av™ that are required at the Earth and Moon respectively. How long does the jour- 
ney take? [The radius of the Moon’s orbit is 384,000 km. Neglect the gravitation of the 
Moon. ] 


7.23% A spacecraft is ‘parked’ in an elliptic orbit around the Earth. What is the most fuel 
efficient method of escaping from the Earth by using a single impulse? 


7.24 A satellite already in the Earth’s heliocentric orbit can fire its engines only once. What 
is the most fuel efficient method of sending the satellite on a ‘flyby’ visit to another planet? 
The satellite can visit either Mars or Venus. Which trip would use less fuel? Which trip 
would take the shorter time? [The orbits of Mars and Venus have radii 1.524 AU and 0.723 
AU respectively. ] 
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7.25 A satellite is ‘parked’ in a circular orbit 250 km above the Earth’s surface. What is the 
most fuel efficient method of transferring the satellite to an (elliptical) synchronous orbit by 
using a single impulse? [A synchronous orbit has a period of 23 hr 56 m.] Find the value of 
Av and apogee distance. 


Effect of resistance 


7.26 A satellite of mass m moves under the attractive inverse square field —(my/r7)F and is 
also subject to the linear resistance force —mK v, where K is a positive constant. Show that 
the governing equations of motion can be reduced to the form 


L2e72Kt 


rF+KrF+ v 5 5 = 0, r°6 = Loe *', 
r r 


where Lo is a constant which will be assumed to be positive. 

Suppose now that the effect of resistance is slight and that the satellite is executing a ‘cir- 
cular’ orbit of slowly changing radius. By neglecting the terms in7 and /’, find an approximate 
solution for the time variation of r and @ in such an orbit. Deduce that small resistance causes 
the circular orbit to contract slowly, but that the satellite speeds up! 


7.27 Repeat the last problem for the case in which the particle moves under the simple har- 
monic attractive field —(mQ?r)? with the same law of resistance. Show that, in this case, 
the body slows down as the orbit contracts. [This problem can be solved exactly in Cartesian 
coordinates, but do not do it this way.] 


Computer assisted problems 


7.28 See the advance of the perihelion of Mercury It is possible to ‘see’ the advance of 
the perihelion of Mercury predicted by general relativity by direct numerical solution. Take 
Einstein’s path equation (see Problem 7.13) in the dimensionless form 


d*v 1 2 
Te re 


where v = au. Here a and e are the semi-major axis and eccentricity of the non-relativistic 
elliptic orbit and 7 = 3MG /ac? is a small dimensionless parameter. For the orbit of Mercury, 
n = 2.3 x 107’ approximately. 

Solve this equation numerically with the initial conditions r = a(1 + e) and’ = 0 when 
@ = 0; this makes 6 = 0 an aphelion of the orbit. To make the precession easy to see, use 
a fairly eccentric ellipse and take 7 to be about 0.005, which speeds up the precession by a 
factor of more than 10*! 


7.29 Orbit with linear resistance Confirm the approximate solution for small resistance 
obtained in Problem 7.26 by numerical solution of the governing simultaneous ODEs. First 
write the governing equations in dimensionless form. Suppose that, in the absence of 


7.8 Problems 193 


resistance, a circular orbit with r = a and 6 = Qis possible; then y = a’Q and Lo = a2Q. 
On taking dimensionless variables p, t defined by p = r/a and t = Qt, and taking Lo = a7Q, 
the governing equations become 


d*p ais dp 1 gt Poa — p—2et 
dt? dt p2 p? dt , 


where € = K/Q is the dimensionless resistance parameter. Solve these equations with the 
initial conditions p = 1, do/dt = 0 and 0 = 0 when t = 0. Choose some small value for € 
and plot a polar graph of the path. 


Chapter Eight 


Non-linear oscillations 
and phase space 


KEY FEATURES 
The key features of this chapter are the use of perturbation theory to solve weakly non-linear 
problems, the notion of phase space, the Poincaré—Bendixson theorem, and limit cycles. 


In reality, most oscillating mechanical systems are governed by non-linear equations. 
The linear oscillation theory developed in Chapter 5 is generally an approximation which 
is accurate only when the amplitude of the oscillations is small. Unfortunately, non-linear 
oscillation equations do not have nice exact solutions as their linear counterparts do, and 
this makes the non-linear theory difficult to investigate analytically. 

In this chapter we describe two different analytical approaches, each of which is suc- 
cessful in its own way. The first is to use perturbation theory to find successive cor- 
rections to the linear theory. This gives a more accurate solution than the linear theory 
when the non-linear terms in the equation are small. However, because the solution is 
close to that predicted by the linear theory, new phenomena associated with non-linearity 
are unlikely to be discovered by perturbation theory! The second approach involves the 
use of geometrical arguments in phase space. This has the advantage that the non-linear 
effects can be large, but the conclusions are likely to be qualitative rather than quantita- 
tive. A particular triumph of this approach is the Poincaré—Bendixson theorem, which 
can be used to prove the existence of limit cycles, a new phenomenon that exists only in 
the non-linear theory. 


8.1. PERIODIC NON-LINEAR OSCILLATIONS 


Most oscillating mechanical systems are not exactly linear but are approximately 
linear when the oscillation amplitude is small. In the case of a body on a spring, the 
restoring force might actually have the form 


S =mQ?x +mAx?, (8.1) 


which is approximated by the linear formula S = mQ*x when the displacement x is small. 
The new constant A is a measure of the strength of the non-linear effect. If A < 0, then 
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FIGURE 8.1 Existence of periodic oscillations for the quartic 
potential energy V = 5mQ?x? + gmAx4 with A < 0. 


S is less than its linear approximation and the spring is said to be softening as x increases. 
Conversely, if A > 0, then the spring is hardening as x increases. The formula (8.1) is 
typical of non-linear restoring forces that are symmetrical about x = 0. If the restoring 
force is unsymmetrical about x = 0, the leading correction to the linear case will be a 


term in x2. 


Existence of non-linear periodic oscillations 


Consider the free undamped oscillations of a body sliding on a smooth horizontal table* 
and connected to a fixed point of the table by a spring whose restoring force is given by 
the cubic formula (8.1). In rectilinear motion, the governing equation is then 


a2 a 
<7 +97x + Ax? =0, (8.2) 


which is Duffing’s equation with no forcing term (see section 8.5). The existence of 
periodic oscillations can be proved by the energy method described in Chapter 6. The 
restoring force has potential energy 
V= 5mQ?x? + pmAx4, 
so that the energy conservation equation is 
smv- + 5mQ? x? + imAx* =F, 
where v = x. The motion is therefore restricted to those values of x that satisfy 


amQ? x? + amAx4 <&E, 


* Would the motion be the same (relative to the equilibrium position) if the body were suspended vertically 
by the same spring? 
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with equality when v = 0. Figure 8.1 shows a sketch of V for a softening spring (A < 0). 
For each value of F in the range 0 < E < Vmax, the particle oscillates in a symmetrical 
range —a < x < aas shown. Thus oscillations of any amplitude less than dmax (= 
Q/|A|!/*) are possible. For a hardening spring, oscillations of any amplitude whatsoever 
are possible. 


Solution by perturbation theory 


Suppose then that the body is performing periodic oscillations with amplitude a. In order 
to reduce the number of parameters, we non-dimensionalise equation (8.2). Let the dimen- 
sionless displacement X be defined by x = aX. Then X satisfies the equation 


aaa a a0 (8.3) 
— € = i. . 
Q? dt? 
with the initial conditions X = 1 and dX/dt = 0 when t = 0. The dimensionless 
parameter ¢, defined by 


(8.4) 


is a measure of the strength of the non-linearity. Equation (8.3) contains € as a parameter 
and hence so does the solution. A major feature of interest is how the period t of the 
motion varies with e. 

The non-linear equation of motion (8.3) cannot be solved explicitly but it reduces 
to a simple linear equation when the parameter € is zero. In these circumstances, one 
can often find an approximate solution to the non-linear equation valid when € is small. 
Equations in which the non-linear terms are small are said to be weakly non-linear and 
the solution technique is called perturbation theory. There is a well established theory 
of such perturbations. The simplest case is as follows: 


Regular perturbation expansion 


If the parameter € appears as the coefficient of any term of an ODE that is not the 
highest derivative in that equation, then, when € is small, the solution corresponding 
to fixed initial conditions can be expanded as a power series in €. 


This is called a regular perturbation expansion* and it applies to the equation (8.3). 
It follows that the solution X (t, €) can be expanded in the regular perturbation series 


X(t, ©) = Xo(t) +eX1(t) +e2X2(t) + (8.5) 


* The case in which the small parameter multiplies the highest derivative in the equation is called a singular 
perturbation. For experts only! 
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The standard method is to substitute this series into the equation (8.3) and then to try to 
determine the functions Xo0(ft), X1(¢), X2(¢), ... . In the present case however, this leads 
to an unsatisfactory result because the functions X(t), X2(t), ... , turn out to be non- 
periodic (and unbounded) even though the exact solution X (t, €) is periodic!* Also, it is 
not clear how to find approximations to t from such a series. 

This difficulty can be overcome by replacing ¢ by a new variable s so that the solution 
X (s,€) has period 27 in s whatever the value of €. Every term of the perturbation series 
will then also be periodic with period 27. This trick is known as Lindstedt’s method. 


Lindstedt’s method 


Let w(€) (= 27/T(€)) be the angular frequency of the required solution of equation (8.3). 
Now introduce a new independent variable s (the dimensionless time) by the equation 
s = a(e)t. Then X (s, €) satisfies the equation 


2 
(2) X"4X+4exX3=0 (8.6) 


with the initial conditions X = 1 and X' = 0 when s = O. (Here ’ means d/ds.) We now 
seek a solution of this equation in the form of the perturbation series 


X (Ss, €) = Xo(s) + €X1(s) + e7Xo(s) +>: : (8.7) 


which is possible when € is small. By construction, this solution must have period 27 for 
all € from which it follows that each of the functions Xo(s), X1(s), X2(s), ...must also 
have period 277. However we have paid a price for this simplification since the unknown 
angular frequency w(€) now appears in the equation (8.6); indeed, the function w(eé) is 
part of the answer to this problem! We must therefore also expand w(e) as a perturbation 
series in €. From equation (8.3), it follows that w(0) = Q so we may write 


w(e 
OO Lit metontt, (8.8) 
where 1, @2,... are unknown constants that must be determined along with the functions 


Xo(s), X1(s), X2(s),...« 
On substituting the expansions (8.7) and (8.8) into the governing equation (8.6) and 
its initial conditions, we obtain: 


(1+ are tage? +++ (XG EXT FOX H+) + 


(Xo teX; +X. +--+) te(Xo teX; FEXI++-- YP =O, 


* This ‘paradox’ causes great bafflement when first encountered, but it is inevitable when the period t of 
the motion depends on €, as it does in this case. To have a series of non-periodic terms is not wrong, 
as is sometimes stated. However, it is certainly unsatisfactory to have a non-periodic approximation to a 
periodic function. 
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with 

XoteX  teX +. =1, 

Ki bex box) ee SO, 
when s = 0. If we now equate coefficients of powers of € in these equalities, we obtain a 
succession of ODEs and initial conditions, the first two of which are as follows: 


From coefficients of €°, we obtain the zero order equation 
X5+Xo0=0, (8.9) 


with Xo = 1 and Xj = 0 whens = 0. 
From coefficients of €!, we obtain the first order equation 


X{+X1 = —2a1 XG — XZ, (8.10) 


with X; = 0 and X' =O whens =0. 


This procedure can be extended to any number of terms but the equations rapidly become 
very complicated. The method now is to solve these equations in order; the only sticking 
point is how to determine the unknown constants @, @2,... that appear on the right sides 
of the equations. The solution of the zero order equation and initial conditions is 


Xo =coss (8.11) 


and this can now be substituted into the first order equation (8.10) to give 


Xi + X1 = 21 coss —cos’s 


a i (8m, —3)coss + 7 COS 35, (8.12) 


on using the trigonometric identity cos3s = 4cos*s — 3coss. This equation can now 
be solved by standard methods. The particular integral corresponding to the cos 3s on the 
right is —(1/8) cos 3s, but the particular integral corrsponding to the cos s on the right is 
(1/2)s sins, since cos s is a solution of the equation X"” + X = 0. The general solution of 
the first order equation is therefore 


X,= (or — §)ssins — 35 608 35 +Acoss+B sins, 
where A and B are arbitrary constants. Observe that the functions cos s, sins and cos 3s 
are all periodic with period 27, but the term s sins is not periodic. Thus, the coefficient 


of s sins must be zero, for otherwise X (s) would not be periodic, which we know it must 
be. Hence 


3 
=. 8.13 
w= s (8.13) 
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which determines the first unknown coefficient in the expansion (8.8) of w(€). The solu- 
tion of the first order equation and initial conditions is then 


1 
X{= aoe — cos 3s). (8.14) 
We have thus shown that, when € is small, 


0) 3 > 
a a=itzeto(e), 


and 


X =coss + 5 (cos s — cos 3s) + O (<°). 


where 5 = (1 + ze +O (?)) Qt. 


Results 


When ¢€ (= a7 / 7) is small, the period t of the oscillation of equation (8.2) with 
amplitude a is given by 


rae == (1+3e+0(e)) = 2 (1-2e+0(2)) (8.15) 


and the corresponding displacement x(t) is given by 


x=a [cos + 5 (cos s —cos3s) + O (<*)] : (8.16) 


where 5 = (1 + 36 +O (?)) Qt. 


This is the approximate solution correct to the first order in the small parameter €. 
More terms can be obtained in a similar way but the effort needed increases exponentially 
and this is best done with computer assistance (see Problem 8.15). 


These formulae apply only when e€ is small, that is, when the non- linearity in the 
equation has a small effect. Thus we have laboured through a sizeable chunk of mathe- 
matics to produce an answer that is only slightly different from the linear case. This sad 
fact is true of all regular perturbation problems. However, in non-linear mechanics, one 
must be thankful for even modest successes. 


8.2. THE PHASE PLANE ((x;, x2)—plane) 


The second approach that we will describe could not be more different from per- 
turbation theory. It makes use of qualitative geometrical arguments in the phase space of 
the system. 
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Systems of first order ODEs 


The notion of phase space springs from the theory of systems of first order ODEs. 
Such systems are very common and need have no connection with classical mechanics. A 
standard example is the predator-prey system of equations 


xX] = ax, — bx 1X2, 


X2 = bx{x2 — CX2, 


which govern the population density x; (¢) of a prey and the population density x2(¢) of its 
predator. In the general case there are n unknown functions satisfying n first order ODEs, 
but here we will only make use of two unknown functions x; (¢), x2(t) that satisfy a pair 
of first order ODEs of the form 

x1 = Fy (x1, x2, 6), 


: 8.17 
X2 = Fo(x1, x2, ¢). ED) 


Just to confuse matters, a system of ODEs like (8.17) is called a dynamical system, 
whether it has any connection with classical mechanics or not! In the predator-prey 
dynamical system, the function Fy = ax, — bx,x2 and the function Fz = bx x2 — cx2. 
In this case F, and F2 have no explicit time dependence. Such systems are said to be 
autonomous; as we shall see, more can be said about the behaviour of autonomous sys- 
tems. 


Definition 8.1 Autonomous system A system of equations of the form 


x1 = Fy(x1, x2), 


. 8.18 
X2 = F2(x1, x2), om 


is said to be autonomous. 


The phase plane 


The values of the variables x;, x2 at any instant can be represented by a point in the 
(x1, x2)-plane. This plane is called the phase plane* of the system. A solution of the 
system of equations (8.17) is then represented by a point moving in the phase plane. The 
path traced out by such a point is called a phase path’ of the system and the set of all 
phase paths is called the phase diagram. In the predator-prey problem, the variables x1, 
X2 are positive quantities and so the physically relevant phase paths lie in the first quadrant 
of the phase plane. It can be shown that they are all closed curves! (See Problem 8.10). 


Phase paths of autonomous systems 


The problem of finding the phase paths is much easier when the system is autonomous. 
The method is as follows: 


* In the general case with n unknowns, the phase space is n-dimensional. 
* Also called an orbit of the system. 
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FIGURE 8.2 Phase diagram for the system 
dx,/dt = x2 —1, dx2/dt = —x, +2. The 


point E (2, 1) is an equilibrium point of the Sia 4 xy 
system. 


Example 8.1 Finding phase paths for an autonomous system 


Sketch the phase diagram for the autonomous system of equations 


dx, 1 
—$ x= xn- 4 
dt fe 

dx2 

—_—=-Xx 2. 
a x1 + 


Solution 
The phase paths of an autonomous system can be found by eliminating the time 
derivatives. The path gradient is given by 


dx dx2/dt 
dx,  dx,/dt 
x1 —2 

7 x9 -—1 


and this is a first order separable ODE satisfied by the phase paths. The general 
solution of this equation is 


(1 — 2)? +02 —1)? =C 


and each (positive) choice for the constant of integration C corresponds to a phase 
path. The phase paths are therefore circles with centre (2, 1); the phase diagram is 
shown in Figure 8.2. 

The direction in which the phase point progresses along a path can be deduced 
by examining the signs of the right sides in equations (8.18). This gives the signs of 
xX, and x2 and hence the direction of motion of the phase point. 


When the system is autonomous, one can say quite a lot about the general nature of 
the phase paths without finding them. The basic result is as follows: 


Theorem 8.1 Autonomous systems: a basic result Each point of the phase space 
of an autonomous system has exactly one phase path passing through it. 


Proof. 


Let (a, b) be any point of the phase space. Suppose that the motion of the phase point (x1, x2) 


satisfies the equations (8.18) and that the phase point is at (a, b) when t = 0. The general theory of ODEs 
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then tells us that a solution of the equations (8.18), that satisfies the initial conditions xj = a, x2 = b 
when f = 0, exists and is unique. Let this solution be {X,(t), X2(t)}, which we will suppose is defined 
for all t, both positive and negative. This phase path certainly passes through the point (a, b) and we must 
now show that there is no other. Suppose then that there is another solution of the equations in which the 
phase point is at (a,b) when t = T, say. This motion also exists and is uniquely determined and, in the 
general case, would not be related to {X ; (t), X2(t)}. However, for autonomous systems, the right sides of 
equations (8.18) are independent of t so that the two motions differ only by a shift in the origin of time. To 
be precise, the new motion is simply {X;(t — t), X2(t — t)}. Thus, although the two motions are distinct, 
the two phase points travel along the same path with the second point delayed relative to the first by the 
constant time t. Hence, although there are infinitely many motions of the phase point that pass through the 
point (a, b), they all follow the same path. This proves the theorem. 


Some important deductions follow from this basic result. 


Phase paths of autonomous systems 


e Distinct phase paths of an autonomous system do not cross or touch each other. 


e Periodic motions of an autonomous system correspond to phase paths that are 
simple* closed loops. 


Figure 8.2 showsthe phase paths of an autonomous system. For this system, all of the 
phase paths are simple closed loops and so every motion is periodic. An exception occurs 
if the phase point is started from the point (2, 1). In this case the system has the constant 
solution x; = 2, x2 = 1 so that the phase point never moves; for this reason, the point 
(2, 1) is called an equilibrium point of the system. In this case, the ‘path’ of the phase 
point consists of the single point (2,1). However, this still qualifies as a path and the 
above theory still applies. Consequently no ‘real’ path may pass through an equilibrium 
point of an autonomous system.‘ 


8.3. THE PHASE PLANE IN DYNAMICS ( (x, v)—plane) 


The above theory seems unconnected to classical mechanics since dynamical 
equations of motion are second order ODEs. However, any second order ODE can be 
expressed as a pair of first order ODEs. For example, consider the general linear oscilla- 
tor equation 


CF oe? era (8.19) 
— — a . : 
dt? dt 
If we introduce the new variable v = dx /dt, then 
dv 


+ 2kv + 27x = F(t). 
dt 


* A simple curve is one that does not cross (or touch) itself (except possibly to close). 
* Tt may appear from diagrams that phase paths can pass through equilibrium points. This is not so. Such 
a path approaches arbitrarily close to the equilibrium point in question, but never reaches it! 
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U U U 


FIGURE 8.3 Typical phase paths for the simple harmonic oscillator equation. Left: No damping. 
Centre: Sub-critical damping. Right: Super-critical damping. 


It follows that the second order equation (8.19) is equivalent to the pair of first order 
equations 


= 0, 


dx 
dt 
aS F(t) — 2kv — 92x. 
dt 

We may now apply the theory we have developed to this system of first order ODEs, 
where the phase plane is now the (x, v)-plane. It is clear that driven motion leads to 
a non-autonomous system because of the presence of the explicit time dependence of 
F(t); undriven motion (in which F(t) = 0) leads to an autonomous system. It is also 
clear that equilibrium points in the (x, v)-plane lie on the x-axis and correspond to the 
ordinary equilibrium positions of the particle. 

The form of the phase paths for the undriven SHO equation 


d?x 42K dx 4 0 

——_ — ‘a 

dt? dt 
depends on the parameters K and Q. We could find these paths by the method used in 
Example 8.1, but there is no point in doing so since we have already solved the equation 
explicitly in Chapter 5. For instance, when K = 0, the general solution is given by 


x =Ccos(Qt — y), 


from which it follows that 


dx CQsin(Q 
a aa sin(Qr — y). 
The phase paths in the (x, v)-plane are therefore similar ellipses centred on the origin, 
which is an equilibrium point. This, and two typical cases of damped motion, are shown 
in Figure 8.3. In the presence of damping, the phase point tends to the equilibrium point 
at the origin as f — oo. Although the equilibrium point is never actually reached, it is 
convenient to say that these paths ‘terminate’ at the origin. 
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FIGURE 8.4 The phase diagram for the undamped Duffing equation with a 
softening spring. 


Example 8.2 Phase diagram for equation d?x /dt? + Q?x + Ax? =0 
Sketch the phase diagram for the non-linear oscillation equation 

d?x /dt? + Q?x + Ax? =0, 
when A < 0 (the softening spring). 


Solution 
This equation is equivalent to the pair of first order equations 


dx 
dt 
dv 
dt 


— vy 
= —9*x — Ax?, 
which is an autonomous system. The phase paths satisfy the equation 


dv - Q2x + Ax3 


dx v 


which is a first order separable ODE whose general solution is 
v=C— 22x? — sAx4, 


where C is a constant of integration. Each positive value of C corresponds to a phase 
path. The phase diagram for the case A < 0 is shown in Figure 8.4. There are three 
equilibrium points at (0,0), (4Q/|A|!/*,0). The closed loops around the origin 
correspond to periodic oscillations of the particle about x = 0. Such oscillations can 
therefore exist for any amplitude less than Q/|A|!/2; this confirms the prediction of 
the energy argument used earlier. Outside this region of closed loops, the paths are 
unbounded and correspond to unbounded motions of the particle. These two regions 
of differing behaviour are separated by the dashed paths (known as separatrices) that 


‘terminate’ at the equilibrium points (LQ/|A|!/?, 0). m 
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FIGURE 8.5 The Poincaré—Bendixson theorem. Any bounded phase path of a plane 
autonomous system must either close itself (top left), terminate at an equilibrium 
point (top right), or tend to a limit cycle (normal case bottom left, degenerate case 
bottom right). 


8.4 POINCARE-BENDIXSON THEOREM: LIMIT CYCLES 


In the autonomous systems we have studied so far, those phase paths that are 
bounded either (i) form a closed loop (corresponding to periodic motion), or (ii) “termi- 
nate’ at an equilibrium point (so that the motion dies away). Figure 8.3 shows examples 
of this. The famous Poincaré—Bendixson theorem* which is stated below, says that there 
is just one further possibility. 


Poincaré—Bendixson theorem 
Suppose that a phase path of a plane autonomous system lies in a bounded 
domain of the phase plane for t > 0. Then the path must either 
e close itself, or 
e terminate at an equilibrium point as t + od, or 


@ tend to a limit cycle (or a degenerate limit cycle) as t > oo. 


A proper proof of the theorem is long and difficult (see Coddington & Levinson [9]). 


* After Jules Henri Poincaré (1854-1912) and Ivar Otto Bendixson (1861-1935). The theorem was first 
proved by Poincaré but a more rigorous proof was given later by Bendixson. 
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The third possibility is new and needs explanation. A limit cycle is a periodic motion 
of a special kind. It is isolated in the sense that nearby phase paths are not closed but are 
attracted towards the limit cycle* ; they spiral around it (or inside it) getting ever closer, as 
shown in Figure 8.5 (bottom left). The degenerate limit cycle shown in Figure 8.5 (bottom 
right) is an obscure case in which the limiting curve is not a periodic motion but has one 
or more equilibrium points actually on it. This case is often omitted in the literature, but 
it definitely exists! 


Proving the existence of periodic solutions 


The Poincaré—Bendixson theorem provides a way of proving that a plane autonomous 
system has a periodic solution even when that solution cannot be found explicitly. If a 
phase path can be found that cannot escape from some bounded domain D of the phase 
plane, and if D contains no equilibrium points, then Poincaré—Bendixson implies that the 
phase path must either be a closed loop or tend to a limit cycle. In either case, the system 
must have a periodic solution lying in D. The method is illustrated by the following 
examples. 


Example 8.3 Proving existence of a limit cycle 


Prove that the autonomous system of ODEs 


gax-y—(x?+yx, 
paaty-O'+yy, 


has a limit cycle. 


Solution 
This system clearly has an equilibrium point at the origin x = y = 0, and a little 
algebra shows that there are no others. Although we have not proved this result, it 
is true that any periodic solution (simple closed loop) in the phase plane must have 
an equilibrium point lying inside it. In the present case, it follows that, if a periodic 
solution exists, then it must enclose the origin. This suggests taking the domain D to 
be the annular region between two circles centred on the origin. 

It is convenient to express the system of equations in polar coordinates 7, 0. The 
transformed equations are (see Problem 8.5) 

i XX] + 2% §- X1x2 a 

r r 


where x; =r cos@ and x2 =r sin@. In the present case, the polar equations take the 
simple form 


r=rl—r?), 6=1. 


* This actually describes a stable limit cycle, which is the only kind likely to be observed. 
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These equations can actually be solved explicitly, but, in order to illustrate the method, 
we will make no use of this fact. Let D be the annular domain a < r < b, where 
0 <a < landb > 1. On the circle r = b,* = b(1 — b*) < 0. Thus a phase point 
that starts anywhere on the outer boundary r = b enters the domain D. Similarly, on 
the circle r = a,* = a(1 — a”) > O and soa phase point that starts anywhere on the 
inner boundary r = a also enters the domain D. It follows that any phase path that 
starts in the annular domain D can never leave. Since D is a bounded domain with no 
equilibrium points within it or on its boundaries, it follows from Poincaré—Bendixson 
that any such path must either be a simple closed loop or tend to a limit cycle. In 
either case, the system must have a periodic solution lying in the annulus a <r < b. 

We can say more. Phase paths that begin on either boundary of D enter D and 
can never leave. These phase paths cannot close themselves (that would mean leaving 
D) and so can only tend to a limit cycle. It follows that the system must have (at least 
one) limit cycle lying in the domain D. [The explicit solution shows that the circle 
r = 1 isa limit cycle and that there are no other periodic solutions.] 


Not all examples are as straightforward as the last one. Often, considerable ingenuity 
has to be used to find a suitable domain D. In particular, the boundary of D cannot always 
be composed of circles. Most readers will find our second example rather difficult! 


Example 8.4 Rayleigh’s equation has a limit cycle 


Show that Rayleigh’s equation 
¥+e¢ (#21) +x =0, 


has a limit cycle for any positive value of the parameter e€. 


Solution 

Rayleigh’s equation arose in his theory of the bowing of a violin string. In the context 
of particle oscillations however, it corresponds to a simple harmonic oscillator with a 
strange damping term. When |x| > 1, we have ordinary (positive) damping and the 
motion decays. However, when |x| < 1, we have negative damping and the motion 
grows. The possibility arises then of a periodic motion which is positively damped on 
some parts of its cycle and negatively damped on others. Somewhat surprisingly, this 
actually exists. 

Rayleigh’s equation is equivalent to the autonomous system of ODEs 


x =v 

; , 2 
b= —x—ev(v*—-1), B20) 
for which the only equilibrium position is at x = v = 0. It follows that, if there is a 
periodic solution, then it must enclose the origin. At first, we proceed as in the first 
example. In polar form, the equations (8.20) become 


P= —er sin Oo ( str oy 


I 
. 21 
6 = —1-—esin?6 (r7 sin 29-1). eal) 
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FIGURE 8.6 A suitable domain D to show that Rayleigh’s 
equation has a limit cycle. 


Let r = c be a circle with centre at the origin and radius less than unity. Then? > 0 
everywhere on r = c except at the two points x = +c, v = O, where it is zero. 
Hence, except for these two points, we can deduce that a phase point that starts on the 
circle r = c enters the domainr > c. Fortunately, these exceptional points can be 
disregarded. It does not matter if there are a finite number of points on r = c where 
the phase paths go the ‘wrong’ way, since this provides only a finite number of escape 
routes! The circle r = c thus provides a suitable inner boundary C, of the domain D. 

Sadly, one cannot simply take a large circle to be the outer boundary of D since 
r has the wrong sign on those segments of the circle that lie in the strip —1 < v < 1. 
This allows any number of phase paths to escape and so invalidates our argument. 
However, this does not prevent us from choosing a boundary of a different shape. A 
suitable outer boundary for D is the contour C2 shown in Figure 8.6. This contour is 
made up from four segments. The first segment AB is part of an actual phase path 
of the system which starts at A(—a, 1) and continues as far as B(b, 1). The form 
of this phase path can be deduced from equations (8.21). When v(= rsin@) > 1, 
i <OQand6@ < —1, so that the phase point moves clockwise around the origin with 
r decreasing. In particular, B must be closer to the origin than A so that b < a, as 
shown. Similarly, the segment A’B’ is part of a second actual phase path that begins at 
A'(a, —1). Because of the symmetry of the equations (8.20) under the transformation 
x — —x,v — —v, this segment is just the reflection of the segment AB in the origin; 
the point B’ is therefore (—b, —1). The contour is closed by inserting the straight line 
segments BA’ and B’A. 

We will now show that, when C2 is made sufficiently large, it is a suitable outer 
boundary for our domain D. Consider first the segment AB. Since this is a phase 
path, no other phase path may cross it (in either direction); the same applies to the 
segment A’B’. Now consider the straight segment B A’. Because a > b, the outward 
unit normal nm shown in Figure 8.6 makes a positive acute angle a with the axis Ox. 
Now the ‘phase plane velocity’ of a phase point is 


R46 j = vi - (ev(v? = 1) +2) j 
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wu —-» 
FIGURE 8.7 The body is supported by a - —_> V . 
rough moving belt and is attached to a fixed 


post by a light spring. <—_—. 
and the component of this ‘velocity’ in the n-direction is therefore 
(vi — (cv? —1) +x) i) -(cosai+sina j) 


= vcosa — sina (cow? —-1) +x) 


=—xsina+v (cose +esina(1— *)) 


< —bsina+(1+e), 


for (x, v) on BA’. We wish to say that this expression is negative so that phase points 
that begin on B A’ enter the domain D. This is true if the contour Cz is made large 
enough. If we let a tend to infinity, then 5 also tends to infinity and a tends to 1/2. 
It follows that, whatever the value of the parameter €, we can make bsina > (1+ €) 
by taking a large enough. A similar argument applies to the segment B’A. Thus 
the contour C2 is a suitable outer boundary for the domain D. It follows that any 
phase path that starts in the domain D enclosed by C, and Cz can never leave. Since 
D is a bounded domain with no equilibrium points within it or on its boundaries, it 
follows from Poincaré—Bendixson that any such path must either be a simple closed 
loop or tend to a limit cycle. In either case, Rayleigh’s equation must have a periodic 
solution lying in D. 

We can say more. Phase paths that begin on either of the straight segments of 
the outer boundary C2 enter D and can never leave. These phase paths cannot close 
themselves (that would mean leaving D) and so can only tend to a limit cycle. It 
follows that Rayleigh’s equation must have (at least one) limit cycle lying in the 
domain D. [There is in fact only one.] 


A realistic mechanical system with a limit cycle 


Finding realistic mechanical systems that exhibit limit cycles is not easy. Driven oscilla- 
tions are eliminated by the requirement that the system be autonomous. Undamped oscil- 
lators have bounded periodic motions, and the introduction of damping causes the motions 
to die away to zero, not to a limit cycle. In order to keep the motion going, the system 
needs to be negatively damped for part of the time. This is an unphysical requirement, but 
it can be simulated in a physically realistic system as follows. 

Consider the system shown in Figure 8.7. A block of mass M is supported by a rough 
horizontal belt and is attached to a fixed post by a light linear spring. The belt is made 
to move with constant speed V. Suppose that the motion takes place in a straight line 
and that x(t) is the extension of the spring beyond its natural length at time rt. Then the 
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FIGURE 8.8 Left: The form of the frictional resistance function G(v). Right: The limit cycle 
in the phase plane; E is the unstable equilibrium point. 


equation of motion of the block is 
M— =—-MQ’x- F(v—V), 


where v = dx/dt, MQ? is the spring constant, and F(v) is the frictional force that the 
belt would exert on the block if the block had velocity v and the belt were at rest; in the 
actual situation, the argument v is replaced by the relative velocity v — V. The function 
F (v) is supposed to have the form shown in Figure 8.8 (left). Although this choice is 
unusual (F (v) is not an increasing function of v for all v), it is not unphysical! 

Under the above conditions, the block has an equilibrium position at x = 
F(V)/(MQ?). The linearised equation for small motions near this equlibrium position 
is given by 


d2x!' 
M—x = —MQ?x' — F'(V) 


dx! 
dt’ 


where x’ is the displacement of the block from the equilibrium position. If we select the 
belt velocity V so that F’(V ) is negative (as shown in Figure 8.8 (left)), then the effective 
damping is negative and small motions will grow. The equilibrium position is therefore 
unstable; oscillations of the block about the equlibrium position then do not die out, but 
instead tend to a limit cycle. This limit cycle is shown in Figure 8.8 (right). The formal 
proof that such a limit cycle exists is similar to that for Rayleigh’s equation. Indeed, this 
system is essentially Rayleigh’s model for the bowing of a violin string, where the belt is 
the bow, and the block is the string. 


Chaotic motions 


Another important conclusion from Poincaré—Bendixson is that no bounded motion of a 
plane autonomous system can exhibit chaos. The phase point cannot just wander about 
in a bounded region of the phase plane for ever. It must either close itself, terminate at 
an equilibrium point, or tend to a limit cycle and none of these motions is chaotic. In 
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particular, no bounded motion of an undriven non-linear oscillator can be chaotic. As we 
will see in the next section however, the driven non-linear oscillator (a non-autonomous 
system) can exhibit bounded chaotic motions. 

It should be remembered that Poincaré—Bendixson applies only to the bounded motion 
of plane autonomous systems. If the phase space has dimension three or more, then other 
motions, including chaos, are possible. 


8.5 DRIVEN NON-LINEAR OSCILLATIONS 


Suppose that we now introduce damping and a harmonic driving force into equa- 
tion (8.2). This gives 
ax dx 2 3 
ae ae x + Ax? = Focos pt, (8.22) 
which is known as Duffing’s equation. 

The presence of the driving force Focos pt makes this system non-autonomous. 
The behaviour of non-autonomous systems is considerably more complex than that of 
autonomous systems. Phase space is still a useful aid in depicting the motion of the sys- 
tem, but little can be said about the general behaviour of the phase paths. In particular, 
phase paths can cross each other any number of times, and Poincaré—Bendixson does not 
apply. Our treatment of driven non-linear oscillations is therefore restricted to perturbation 
theory. 

In view of the large number of parameters, it is sensible to non-dimensionalise equa- 
tion (8.22). The dimensionless displacement X is defined by x = (Fo/ px and the 
dimensionless time s by s = pt. The function X (s) then satisfies the dimensionless equa- 
tion 


" k / Q ° 3 
XxX" +(—)]xX'+[([—] X +€X° =coss, (8.23) 
P P 
where the dimensionless parameter € is defined by 
2 
_ Fo 
ee 


When € = QO, equation (8.23) reduces to the linear problem. This suggests that, when 
€ is small, we may be able to find approximate solutions by perturbation theory. The 


(8.24) 


linear problem always has a periodic solution for X (the driven motion) that is harmonic 
with period 27r. Proving the existence of periodic solutions of Duffing’s equation is an 
interesting and difficult problem. Here we address this problem for the case in which € 
is small, a regular perturbation on the linear problem. To simplify the working we will 
suppose that damping is absent; the general features of the solution remain the same. The 
governing equation (8.23) then simplifies to 


2 
XxX" 4 Q ye x3 = 
— eX” =Ccoss. (8.25) 
Dp 
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Initial conditions do not come into this problem. We are simply seeking a family of 
solutions X (s, €), parametrised by e€, that are (i) periodic, and (ii) reduce to the linear 
solution when € = 0. We need to consider first the periodicity of this family of solutions. 
In the non-linear problem, we have no right to suppose that the angular frequency of the 
driven motion is equal to that of the driving force, as it is in the linear problem; it could 
depend on €. However, suppose that the driving force has minimum period To and that a 
family of solutions X (s, €)) of equation (8.25) exists with minimum period t (= T(€)). 
Then, since the derivatives and powers of X also have period T, it follows that the left 
side of equation (8.25) must have period t. The right side however has period to and this 
is known to be the minimum period. It follows that t must be an integer multiple of to; 
note that tT is not compelled to be equal to to.* However, in the present case, the period 
T(€) is supposed to be a continuous function of € with tT = t9 when € = 0. It follows 
that the only possibility is that t = To for all €. Thus the period of the driven motion 
is independent of € and is equal to the period of the driving force. This argument leaves 
open the possibility that other driven motions may exist that have periods that are integer 
multiples of t9. However, even if they exist, they cannot occur in our perturbation scheme. 

We therefore expand X (s, €) in the perturbation series 


X(t) = Xo) F+eX1D FEXAN ++, (8.26) 


and seek a solution of equation (8.25) that has period 277. It follows that the expansion 
functions Xo(s), X1(s), X2(s), ... must also have period 27. If we now substitute this 
series into the equation (8.25) and equate coefficients of powers of €, we obtain a succes- 
sion of ODEs the first two of which are as follows: 


From coefficients of €9: 


2 
xX + e Xo= 8.27 
0 . 0 =Ccoss. (8.27) 
From coefficients of €!: 
2 
pues (=) y=S=k, (8.28) 
il Pp 1 = 0: . 


For p # Q, the general solution of the zero order equation (8.27) is 


2 
Xo= (es) cos s + A cos(Qs/p) + B sin(Qs/p), 
Q° —p 


where A and B are arbitrary constants. Since Xo is known to have period 277, it follows 
that A and B must be zero unless Q2 is an integer multiple of p; we will assume this is not 


* The fact that t is the minimum period of X does not neccessarily make it the minimum period of the left 
side of equation (8.25). 
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the case. Then the required solution of the zero order equation is 


2 


A= 2 — p2 


COS S. (8.29) 


The first order equation (8.28) can now be written 


3 


Q\? 
x{+(2) xXy=- a cos? s 
Pp —?p 


6 
= ea pp (3. cos s + cos3s), (8.30) 


on using the trigonometric identity cos 3s = 4 cos’ s—3 cos s. Since Q/p is not an integer, 
the only solution of this equation that has period 27 is 


8 
Dp 3 cos s cos 3s 
Me 8.31 

; 4(22 — p23 (= =p oe 2) — 


Results 


When € (= FRA / p®) is small, the driven response of the Duffing equation (8.22) (with 
k = 0) is given by 


Fo : 3p® cos pt p® cos 3pt af) ( *) 
x = ——~ ]} co 
OF — pt [PPE \ CP p35 © (AF pr2(Q? —9py JET 
(8.32) 


This is the approximate solution correct to the first order in the small parameter €. 
More terms can be obtained in a similar way but this is best done with computer assistance. 

The most interesting feature of this formula is the behaviour of the first order cor- 
rection term when Q is close to 3p, which suggests the existence of a super-harmonic 
resonance with frequency 3p. Similar ‘resonances’ occur in the higher terms at the fre- 
quencies 5p, 7p, ..., and are caused by the presence of the non-linear term A.x?. It should 
not however be concluded that large amplitude responses occur at these frequencies.* The 
critical case in which Q = 3p is solved in Problem 8.14 and reveals no infinities in the 
response. 


Sub-harmonic responses and chaos 


We have so far left open the interesting question of whether a driving force with minimum 
period t can excite a subharmonic response, that is, a response whose minimum period is 


* This is a subtle point. Like all power series, perturbation series have a certain ‘radius of convergence’. 
When ail the terms of the perturbation series are included, € is resticted to some range of values —eg < 
€ < €g. What seems to happen when Q approaches 3p is that €9 approaches zero so that the first order 
correction term never actually gets large. 
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FIGURE 8.9 Two different periodic responses to the same driving force. Left: A 
response of period 27, Right: A sub-harmonic response of period 477. 


an integer multiple of t. This is certainly not possible in the linear case, where the driving 
force and the induced response always have the same period. One way of investigating 
this problem would be to expand the (unknown) response x(t) as a Fourier series and 
to substitute this into the left side of Duffing’s equation. One would then require all the 
odd numbered terms to magically cancel out leaving a function with period 2t. Unlikely 
though this may seem, it can happen! There are ranges of the parameters in Duffing’s 
equation that permit a sub-harmonic response. Indeed, it is possible for the same set 
of parameters to allow more than one periodic response. Figure 8.9 shows two different 
periodic responses of the equation d*x /dt? + kdx /dt +x* = Acost, each corresponding 
to k = 0.04, A = 0.9. One response has period 27 while the other is a subharmonic 
response with period 477. Which of these is the steady state response depends on the 
initial conditions. It is also possible for the motion to be chaotic with no steady state ever 
being reached, even though damping is present. 


Problems on Chapter 8 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Periodic oscillations: Lindstedt’s method 
8.1 A non-linear oscillator satisfies the equation 

(1 + ex?) ¥4x = 0, 
where € is a small parameter. Use Linstedt’s method to obtain a two-term approximation to 
the oscillation frequency when the oscillation has unit amplitude. Find also the corresponding 
two-term approximation to x(t). [You will need the identity 4 cos? s = 3coss + cos 3s.] 


8.2 A non-linear oscillator satisfies the equation 


X+x+ex=0, 
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where € is a small parameter. Use Linstedt’s method to obtain a two-term approximation to 
the oscillation frequency when the oscillation has unit amplitude. [You will need the identity 
16cos*s = 10coss + 5cos 3s + cos5s.] 


8.3 Unsymmetrical oscillations A non-linear oscillator satisfies the equation 
ss : De 
X¥+x+ex* =0, 


where € is a small parameter. Explain why the oscillations are unsymmetrical about x = 0 in 
this problem. 

Use Linstedt’s method to obtain a two-term approximation to x(t) for the oscillation in 
which the maximum value of x is unity. Deduce a two-term approximation to the minimum 
value achieved by x(t) in this oscillation. 


8.4% A limit cycle by perturbation theory Use perturbation theory to investigate the limit 
cycle of Rayleigh’s equation, taken here in the form 


¥+e(4%?-1)et+x=0, 


where € is a small positive parameter. Show that the zero order approximation to the limit 
cycle is a circle and determine its centre and radius. Find the frequency of the limit cycle 
correct to order €?, and find the function x(t) correct to order €. 


Phase paths 
8.5 Phase paths in polar form Show that the system of equations 
x1 = Fy(x, x2, 0), X2 = F2(x1, x2, 0) 


can be written in polar coordinates in the form 


X{F\ + x2F2 6- Xx Fy — x2F| 


1 : r2 ; 


where x} =r cos@ and x2 =r sing. 
A dynamical system satisfies the equations 


xX=—x4+y, 
pei): 


Convert this system into polar form and find the polar equations of the phase paths. Show that 
every phase path encircles the origin infinitely many times in the clockwise direction. Show 
further that every phase path terminates at the origin. Sketch the phase diagram. 


8.6 A dynamical system satisfies the equations 


t=x-y—(?+y’)x, 
paxty-@+yy. 
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Convert this system into polar form and find the polar equations of the phase paths that begin 
in the domain 0 < r < 1. Show that all these phase paths spiral anti-clockwise and tend to 
the limit cycle r = 1. Show also that the same is true for phase paths that begin in the domain 
r > 1. Sketch the phase diagram. 


8.7 A damped linear oscillator satisfies the equation 
¥+x+x=0. 


Show that the polar equations for the motion of the phase points are 


* = —r sin? 6, 6 = — (1+ 5sin26). 


Show that every phase path encircles the origin infinitely many times in the clockwise direc- 
tion. Show further that these phase paths terminate at the origin. 


8.8 A non-linear oscillator satisfies the equation 
as +3 = 
X¥4+x°+x=0. 


Find the polar equations for the motion of the phase points. Show that phase paths that begin 
within the circle r < 1 encircle the origin infinitely many times in the clockwise direction. 
Show further that these phase paths terminate at the origin. 


8.9 A non-linear oscillator satisfies the equation 
i+ (7 +2?- 1) i +x=0. 


Find the polar equations for the motion of the phase points. Show that any phase path that 
starts in the domain 1 < r < V3 spirals clockwise and tends to the limit cycle r = 1. [The 
same is true of phase paths that start in the domain 0 <r < 1.] What is the period of the limit 
cycle? 


8.10 Predator—prey Consider the symmetrical predator-prey equations 


x=xX-—xy, y=xy-y, 


where x(t) and y(t) are positive functions. Show that the phase paths satisfy the equation 
(ve) (ve) = A, 


where A is a constant whose value determines the particular phase path. By considering the 
shape of the surface 


= (xe) (ye), 


deduce that each phase path is a simple closed curve that encircles the equilibrium point at 
(1, 1). Hence every solution of the equations is periodic! [This prediction can be confirmed 
by solving the original equations numerically. ] 
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Poincaré—Bendixson 


8.11 Use Poincaré—Bendixson to show that the system 


gox—y— (x2 44y?)x, 
yaxty—(? +4yy, 


has a limit cycle lying in the annulus 5 <r<l. 
8.12 Van der Pol’s equation Use Poincaré—Bendixson to show that Van der Pol’s equation* 
KX +€X (x? - 1)+x=0, 


has a limit cycle for any positive value of the constant ¢. [The method is similar to that used 
for Rayleigh’s equation in Example 8.4.] 


Driven oscillations 
8.13 A driven non-linear oscillator satisfies the equation 
X + ex? +.x = cos pt, 


where €, p are positive constants. Use perturbation theory to find a two-term approximation 
to the driven response when € is small. Are there any restrictions on the value of p? 


8.14 Super-harmonic resonance A driven non-linear oscillator satisfies the equation 
€4+9x+exr°= cost, 


where € is a small parameter. Use perturbation theory to investigate the possible existence of 
a superharmonic resonance. Show that the zero order solution is 


1 
x= 8 (cost + dg cos 3r), 
where the constant ao is a constant that is not known at the zero order stage. 
By proceeding to the first order stage, show that ap is the unique real root of the cubic 
equation 


3a§ + 6ay + 1 = 0, 


which is about —0.164. Thus, when driving the oscillator at this sub-harmonic frequency, the 
non-linear correction appears in the zero order solution. However, there are no infinities to be 
found in the perturbation scheme at this (or any other) stage. 

Plot the graph of xo(t) and the path of the phase point (xo(f), xt): 


* After the extravagantly named Dutch physicist Balthasar Van der Pol (1889-1959). The equation arose 
in connection with the current in an electronic circuit. In 1927 Van der Pol observed what is now called 
deterministic chaos, but did not investigate it further. 
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Computer assisted problems 
8.15 Linstedt’s method Use computer assistance to implement Lindstedt’s method for the 
equation 


K$+xtex27=0. 


Obtain a three-term approximation to the oscillation frequency when the oscillation has unit 
amplitude. Find also the corresponding three-term approximation to x(f). 


8.16 Van der Pol’s equation A classic non-linear oscillation equation that has a limit cycle 
is Van der Pol’s equation 


¥+e(x?-1)e+x=0, 


where € is a positive parameter. Solve the equation numerically with € = 2 (say) and plot the 
motion of a few of the phase points in the (x, v)-plane. All the phase paths tend to the limit 
cycle. One can see the same effect in a different way by plotting the solution function x(t) 
against ¢. 


8.17 Sub-harmonic and chaotic responses Investigate the steady state responses of the 
equation 


K+kx +x? = Acost 
for various choices of the parameters k and A and various initial conditions. First obtain the 


responses shown in Figure 8.9 and then go on to try other choices of the parameters. Some 
very exotic results can be obtained! For various chaotic responses try K = 0.1 and A = 7. 
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MULTI-PARTICLE SYSTEMS 
AND CONSERVATION PRINCIPLES 


CHAPTERS IN PART TWO 


Chapter9 The energy principle 
Chapter 10 The linear momentum principle 


Chapter 11 The angular momentum principle 


Chapter Nine 


The energy principle 


and energy conservation 


KEY FEATURES 
The key features of this chapter are the energy principle for a multi-particle system, the poten- 
tial energies arising from external and internal forces, and energy conservation. 


This is the first of three chapters in which we study the mechanics of multi-particle 
systems. This is an important development which greatly increases the range of problems 
that we can solve. In particular, multi-particle mechanics is needed to solve problems 
involving the rotation of rigid bodies. 

The chapter begins by obtaining the energy principle for a multi-particle system. This 
is the first of the three great principles of multi-particle mechanics* that apply to every 
mechanical system without restriction. We then show that, under appropriate conditions, 
the total energy of the system is conserved. We apply this energy conservation principle 
to a wide variety of systems. When the system has just one degree of freedom, the energy 
conservation equation is sufficient to determine the whole motion. 


9.1 CONFIGURATIONS AND DEGREES OF FREEDOM 


A multi-particle system S may consist of any number of particles P;, P2,..., 


Py, with masses m1, m2..., my respectively.’ A possible ‘position’ of the system is 
called a configuration. More precisely, if the particles P;}, P2,..., Py of a system have 
position vectors r1, r2,..., “Nn, then any geometrically possible set of values for the posi- 


tion vectors {7;} is a configuration of the system. 

If the system is unconstrained, then each particle can take up any position in space 
(independently of the others) and all choices of the {r;} are possible. This would be the 
case, for instance, if the particles of S were moving freely under their mutual gravita- 
tion. On the other hand, when constraints are present, the {r;} are restricted. Suppose 
for instance that the particles P; and P> are connected by a light rigid rod of length a. 


* The other two are the linear momentum and angular momentum principles. 
¥ To save space, we will usually express this by saying that S is the system of particles {P;} with masses 
{m;}, the range of the index number 7 being understood to be 1 <i < N. 
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FIGURE 9.1 The multi-particle system S e P 

consists of N particles P;, P2,..., Py, of e 7 e 

which the typical particle P; is labelled. The 

particle P; has mass m;, position vector rj, O ee a es as 
and velocity v;. a vU; 


FIGURE 9.2 The generalised coordinates x O \@ Qa 

and @ are sufficient to specify the 

configuration of this two-particle system in | P 
planar motion. | 2 


This imposes the geometrical restriction |r; — r2| = a so that not all choices of the {7r;} 
are then possible. This difference is reflected in the number of scalar variables needed 
to specify the configuration of S. In the unconstrained case, all of the position vectors 
{r;} must be specified separately. Since each of these vectors may be specified by three 
Cartesian coordinates, it follows that a total of 3N scalar variables are needed to specify 
the configuration of an unconstrained N -particle system. When constraints are present, 
this number is reduced, often dramatically so. 

For example, consider the system shown in Figure 9.2, which consists of two particles 
P, and P> connected by a light rigid rod of length a. The particle P; is also constrained 
to move along a fixed horizontal rail and the whole system moves in the vertical plane 
through the rail. The two scalar variables x and 6 shown are sufficient to specify the 
configuration of this system. This contrasts with the six scalar variables that would be 
needed if the two particles were in unconstrained motion. The variables x and 6 are said 
to be a set of generalised coordinates for this system.* Other choices for the generalised 
coordinates could be made, but the number of generalised coordinates needed is always 
the same. 


Definition 9.1 Degrees of freedom The number of generalised coordinates needed to 
specify the configuration of a system S is called the number of degrees of freedom of S. 


Importance of degrees of freedom 


The number of degrees of freedom of a system is important because it is equal to the 
number of equations that are needed to determine the motion of the system. For example, 


* Besides being sufficient to specify the configuration of the system, the generalised coordinates are also 
required to be independent, that is, there must be no functional relation between them. The coordinates 
x, 6 in Figure 9.2 are certainly independent variables. If the coordinates were connected by a functional 
relation, they would not all be needed and one of them could be discarded. 
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FIGURE 9.3 The multi-particle S consists of 
N particles P|, P2,..., Py, of which the 
typical particles P; and P; are shown 
explicitly. The force F; is the external force 
acting on P; and the force G;; is the internal 
force exerted on P; by the particle P;. 


the system shown in Figure 9.2 has two degrees of freedom and so needs two equations to 
determine the motion completely. 


Example 9.1 Degrees of freedom 


Find the number of degrees of freedom of the following mechanical systems: (i) the 
simple pendulum (moving in a vertical plane), (ii) a door swinging on its hinges, (iii) 
a bar of soap (a particle) sliding on the inside of a hemispherical basin, (iv) a rigid 
rod sliding on a flat table, (v) four rigid rods flexibly jointed to form a quadrilateral 
which can slide on a flat table. 


Solution 
(i) 1 (i)1 Gi) 2 (Gv)3 (Vv) 4.9 


9.2 THE ENERGY PRINCIPLE FOR A SYSTEM 


Let S be a system of N particles {P;}, as shown in Figure 9.3. We classify the 
forces acting on the particles of S as being external or internal. External forces are those 
originating from outside S. (In the case of a single particle, these are the only forces 
that act.) Uniform gravity is an example of an external force. However, in multi-particle 
systems, the particles are also subject to their own mutual interactions, that is, the forces 
that they exert upon each other. These mutual interactions are called the internal forces 
acting on S. The situation is shown in Figure 9.3. F; is the external force acting on the 
particle P;, while G;; is the internal force exerted on P; by the particle P;. By the Third 
Law, the force G ;; that P; exerts on P; must be equal and opposite to the force G;;, and 
both forces must be parallel to the straight line joining P; and P;. In short, the {G;;} must 
satisfy 


G ji = —Gij;, and Gi; | (rj = rj). (9.1) 
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To obtain the energy principle for the system S, we proceed in the same way as we did for 
a single particle in section 6.1. The equation of motion for the particle P; is* 


dv; a 
mj— = Fi +) Gij, (9.2) 


where v; is the velocity of P; at time ¢. On taking the scalar product of both sides of 
equation (9.2) with v; and then summing the result over all the particles (1 <i < N), we 
obtain 


N N 


“=> Pit DG: Uj, (9.3) 


i=1 i= 


where 


N 


1 2 
T= )0 smilvjl, 


i=l 


the total kinetic energy of the whole system S. Suppose that, in the time interval [t, tg], 
the system S moves from configuration A to configuration 6. On integrating equation 
(9.3) with respect to ¢ over the time interval [t.4, fg] we obtain 


Tg —Ta = vf" at Dd |" ij + vj dt (9.4) 


i=1 i=l j=1 


where 7.4 and Ty are the kinetic energies of the system S at times t4 and tg respectively. 
This is the energy principle for a multi-particle system moving under the external forces 
{F;} and internal forces {G;;}. This impressive looking result can be stated quite simply 
as follows: 


Energy principle for a multi-particle system 


In any motion of a system, the increase in the total kinetic energy of the system in 
a given time interval is equal to the total work done by all the external and internal 
forces during this time interval. 


* The summation over j in equation (9.2) contains the term G;; which corresponds to the force that the 
particle P; exerts upon itse/f. Since such a force is not actually present, we should really say that the 
summation is over the range | < 7 < N with 7 #7. Since this would make the formulae look messy, we 
adopt the device of regarding the terms G11, G22, ..., Gyy (which do not actually exist) as being zero. 
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9.3 ENERGY CONSERVATION FOR A SYSTEM 


In order to develop an energy conservation principle, we need to write the right 
side of the energy principle (9.4) in the form V(A) — V(B), where V is the potential 
energy function for the whole system. We first consider unconstrained systems. 


Unconstrained systems 


When the system is unconstrained, all the forces that act on the system are specified 
directly. We will assume that the external forces F; are conservative fields. In this case 
F; = — grad ¢;, where ¢; is the potential energy function of the field F;. Then the total 
work done by the external forces can be written 


N tg 


N 
Df Fi-wat = Yours) ~ dlrs) = OH - OB), 
tA 


i=1 


i= 


where 
P(r1,7r2,...,7n) = b1(r1) + b2(r2) +--+ + on (rN) 
is the potential energy of S arising from the external forces. 


Example 9.2 Potential energy under uniform gravity 


Find the potential energy ® when the external forces on S arise from uniform gravity. 


Solution 

Under uniform gravity, the force F; exerted on particle P; is F; = —m;gk, where the 
unit vector k points vertically upwards. This conservative field has potential energy 
oj = m;g2z;, where z; is the z-coordinate of P;. The total potential energy of S due to 
uniform gravity is therefore 


D = m1 87; +m2g272 + +--+ MNngzZN. 


On using the definition of centre of mass given in section 3.5, this can be written in 
the alternative form 


© = MgZ, 


where M is the total mass of S, and Z is the z-cordinate of the centre of mass of S. 
Thus the potential energy of any system due to uniform gravity is the same as if all its 
mass were concentrated at its centre of mass. 


We now need to make a similar transformation to show that the work done by the 
internal forces can be written in the form (A) — Y(B), where W is the internal potential 
energy. The argument is as follows: 


We know from the Third Law that the {G;;} satisfy the conditions (9.1), but a little more must be 
assumed. We further assume that the magnitude of G;; depends only on r;;, the distance between P; and 
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P;.* Internal forces that satisfy this conditions will be called conservative; mutual gravitation forces are a 
typical example. Hence, when the internal forces are conservative, G;; must have the form 


Gjj = hij (rip) Ti; (9.5) 
where (see Figure 9.3) 
ryyp=ri-rj rij = Irij| Tij — rjj/Tij- (9.6) 


Note that hj; is the repulsive force that the particles P; and P; exert upon each other. 
Consider now the rate of working of the pair of forces G; jj and G ;;. This is 


drjj _ (=) a drjj 
= fs 


Gij + vj + Gjj +0; Gal yy —v i) = by Ty* 


dt rij dt 


= hij (rij) ae 


on using equations (9.1), (9.6) and the identity rj; + 7;; = r;jrj;. The total work done by the forces G;; 
and G ji during the time interval [t_,, tj] is therefore 


'B drij rij(B) 
Po maeiy Gar = PO migtoipr any = Hijri AD — Hy ij BD. 
tA rij(A) 
where H,;; is the indefinite integral of —h;;. The function Hj ;(r;;) is called the mutual potential energy 
of the particles P; and P;. 

It follows that the aa work done by all the internal forces in the time interval [74, tj] can be written 
in the form 


a Gij vj dt = V(A) — VB), 


i=1 j=1 
where 
N i-1 
Wrq.r2.--tN) =>, >, Hy (is) 
i=1 j=1 


is the potential energy of S arising from the internal forces. This potential energy is just the sum of the 


mutual potential energies of all pairs of particles. 


Example 9.3 Internal energy of three charged particles 


Three particles P;, P2, P3 carry electric charges e;, e2, e3 respectively. Find the 
internal potential energy WV. 


Solution 

In cgs/electrostatic units, the particles P; and P2 repel each other with the force 
hy2(r12) = e1e2/ (12), where 712 is the distance between P; and P2. Their mutual 
potential energy is therefore 


e1e2 e1e2 
Ay) = - f mornddra =- | — aan = —. 
(r12) r12 


* This is equivalent to the very reasonable assumptions that the magnitude of G;; is invariant under spatial 
translations and rotations of each pair of particles P; and P;, and is independent of the time. 
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The internal potential energy of the whole system is therefore 


e1e2  e1€3— &2€3 
v= + + | 
r12 P13 '23 


On combining the above results, the energy principle (9.4) can be written 
Tg —Ta = V(A)—V(B), 


where V = ®+ W is the total potential energy of the system S. This is equivalent to the 
energy conservation formula 


T+V=E (9.7) 


where EF is the total energy of the system.This result can be summarised as follows: 


Energy conservation for an unconstrained system 


When both the external and internal forces acting on a system are conservative, the 
sum of its kinetic and potential energies* remains constant in the motion. 


Example 9.4 A star with two planets 


A star of very large mass M is orbited by two planets P; and Pz of masses m, and 
mz. Find the energy conservation equation for this system. 


Solution 

Since the mass of the star is supposed to be very much larger than the planetary 
masses, we will neglect its motion and suppose that it is fixed at the origin O. We 
then have a two-particle problem in which the planets move under the (external) grav- 
itational attraction of the star and their (internal) mutual gravitational interaction. This 
is an unconstrained system. 

The total potential energy arising from external forces is then 
MmG Mm2G 


® ama ’ 
T1 r2 


where r1,/"2 are the distances O P|, O Po. 

The particles P; and Pp» repel each other with the force hj2(712) = 
—m\m2G/ (r12)?, where 712 is the distance between P; and P2. Their mutual poten- 
tial energy is therefore 


mim2G mim2G 
——_ ar}. = — ——, 
(r12)? r12 


and this is the only contribution to the internal potential energy V. 


Ay = -fn 2(ri2)driy2 = 


* The potential energy is the total of the potential energies arising from both the external and internal forces. 
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Since the system is unconstrained and the external and internal forces are con- 
servative, energy conservation applies. The energy conservation equation for the 
system is 


1ny |v1|2 + 4m |v2|2 — MG (= + “2) TOMY oF 
ry r2 ri2 
where v1, V2 are the velocities of the planets P;, P2, and E is the constant total energy. 
The value of E is determined from the initial conditions. 
Since this system has six degrees of freedom (four if the motions are confined 
to a plane through O), the energy conservation equation is by no means sufficient to 
determine the motion! @ 


Question Cana planet escape? 


If the initial conditions are such that E < 0, is it possible for a planet to escape to 
infinity? 


Answer 


If E < 0, then it is certainly not possible for both planets to escape to infinity, since 
the total energy would then be positive. However, the escape of one planet is not 
prohibited by energy conservation. This does not mean however that such an escape 
will actually happen. 


Constrained systems 


When a system is subject to constraints, not all the forces that act on the system are speci- 


fied. This is because constraints are enforced by constraint forces that are not part of the 


specification of the problem; all we know is that their effect is to enforce the given con- 


straints. The work done by constraint forces cannot generally be calculated (or expressed 


in terms of a potential energy) and we are restricted to those systems for which the total 


work done by the constraint forces happens to be zero.” 


The constraint forces acting on the system may be external (for example, when a 


particle of the system is constrained to remain at rest), or internal (for example, when two 


particles of the system are constrained to remain the same distance apart). 


A 


B 


The list of external constraint forces that do no work is the same as that given in 
Section 6.5 for single particle motion. 

The most important result regarding internal constraint forces that do no work is 
this: The total work done by any pair of mutual interaction forces is zero when 
the particles on which they act are constrained to remain a fixed distance apart. 
The proof is as follows: 

Suppose two particles P; and P; are constrained to remain a fixed distance apart and that their 


mutual interaction forces are G;; and G j; (see Figure 9.3). Since the distance between P; and P; 
is constant, it follows that (r; — rj) + (rj — rj) is constant, which, on differentiating with respect 


* Individual constraint forces may do work. 
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FIGURE 9.4 The particles Q and R slide along a smooth horizontal rail 
while the particle P moves vertically. 


to f, gives 
(rj —1rj)+ (vj — vj) =0. 


Thus the vector (vj — v ;) must be perpendicular to the straight line joining P; and P;. Hence, 
the rate of working of the two forces G;; and G ;; is 


Gij -vj + Gii -vp= Gij . (v; = -v;) = 0, 
since G;; is known to be parallel to the straight line joining P; and P;. Thus the internal con- 
straint forces Gj; and G ;; do no work in total. 


It follows, for example, that the two tension forces exerted by a light inextensible 
string do no work in total. It further follows that the internal forces that enforce 
rigidity in a rigid body do no work in total. This important result allows us to 
solve rigid body problems by energy methods. 


Our result for constrained systems can be summarised as follows: 


Energy conservation for a constrained system 


When the specified external and internal forces acting on a system are conservative, 
and the constraint forces do no work in total, the sum of the kinetic and potential 
energies of the system remains constant in the motion. 


Example 9.5 A constrained three-particle system 


Figure 9.4 shows a ball P of mass 2m suspended by light inextensible strings of 
length a from two sliders Q and R, each of mass m, which can move on a smooth 
horizontal rail. The system moves symmetrically so that O, the mid-point of Q and 
R, remains fixed and P moves on the downward vertical through O. Initially the 
system is released from rest with the three particles in a straight line and with the 
strings taut. Find the energy conservation equation for the system. 


Solution 
This is a system with one degree of freedom and we take the angle 6 as the generalised 
coordinate. Let z and x be the displacements of the particles P and Q from the fixed 
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point O. Then, in terms of the generalised coordinate 0,.x = acos@ and z =asiné@. 
Differentiating these formulae with respect to ¢ then gives 


x = —(asin0)6, z= (acos0)6. 


Hence the total kinetic energy of the system is given by 


T= 3 (2m)z” + Smx? + tmx? = ma’6’. 


The only contribution to the potential energy comes from uniform gravity, so 
that 


V = —(2m)gz+0+0= —2mgasin6, 


where we have taken the zero level of potential energy to be at the rail. 

We must now show that the constraint forces do no work. The reactions exerted 
by the smooth rail on the particles Q and R are perpendicular to the rail and therefore 
perpendicular to the velocities of Q and R; these reactions therefore do no work. Also, 
the tension forces exerted by the inextensible strings do no work in total. Hence, the 
constraint forces do no work in total. 

Energy conservation therefore applies in the form 


ma?6* — 2mga sind = E. 


From the initial conditions @ = 6 = 0 whent = 0, it follows that E = 0. The energy 
conservation equation for the system is therefore 


. 2 
6? sing = 0." 
a 
Question When do the sliders collide? 
Find the time that elapses before the sliders collide. 


Answer 


Since this system has only one degree of freedom, the motion can be found from 
energy conservation alone. From the energy conservation equation, it follows that 


do 2" 
ee (=) (sina)!/?, 
dt a 


and, since @ is an increasing function of t, we take the positive sign. This equation is 
a first order separable ODE. 
Since the sliders collide when 6 = 7/2, the time t that elapses is given by 


a\'/2 pri2 go a\ i? 
t=(— i ——__ + 1.85(-) 
28 o — (sin d)!/2 g 


9.3. Energy conservation for a system 231 


Initially 


FIGURE 9.5 A uniform rope is released from rest hanging over the edge of a smooth table (left). 
After time rf it has displacement x (right). 


Example 9.6 Rope sliding off a table 


A uniform inextensible rope of mass M and length a is released from rest hanging 
over the edge of a smooth horizontal table, as shown in Figure 9.5. Find the speed of 
the rope when it has the displacement x shown. 


Solution 
A rope is a continuous distribution of mass, unlike the discrete masses that appear in 
our theory. We regard the rope as being represented by a /ight inextensible string of 
length a with N particles, each of mass M/N, attached to the string at equally spaced 
intervals along its length. When N is very large, we expect this discrete set of masses 
to approximate the behaviour of the rope. 

Since each particle of the rope has the same speed v (= x), the total kinetic 
energy of the rope is simply 


The only contribution to the potential energy comes from uniform gravity. If we 
take the reference state for V to be the initial configuration (Figure 9.5 (left)), then 
the potential energy in the displaced configuration (right) is the same as if a length x 
of the rope lying on the table were cut off and this piece were then suspended from 
the hanging end. In the continuous limit (that is, as N — oo), this piece of rope has 
mass Mx/a and its centre of mass is lowered a distance b + (x /2) by this operation. 
The potential energy of the rope in the displaced configuration is therefore 


va—(“)e (b+). 


We must now show that the constraint forces do no work. The reactions exerted 
by the smooth table on the particles of the rope are always perpendicular to the veloc- 
ities of these particles; these reactions therefore do no work. Also, the tension forces 
exerted by each segment of the inextensible string (connecting adjacent particles of 
the rope) do no work in total. Hence, the constraint forces do no work in total. 

Energy conservation therefore applies in the form 


5Mv — (“*) (6+ 4s) =E. 
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The initial condition v = 0 when x = 0 implies that E = 0. The energy equation for 
the rope is therefore 


vy? = 2 x(x + 2b). 
a 


This gives the speed of the rope when it has displacement x. This formula holds while 
there is still some rope left on the the table top. & 


Note. In the above solution we have assumed that the rope follows the contour of the 
table edge and then falls vertically. However, it can be shown that this cannot be true 
when the rope is close to leaving the table. What actually happens is that the end of 
the rope overshoots the table edge. This is a tricky point which we will not investigate 
further. 


Question Displacement at time t 


Find the displacement of the rope at time f. 


Answer 


Since this system has only one degree of freedom, the motion can be found from 
energy conservation alone. From the energy conservation equation, it follows that 


dx 


ae = -tn xx + apy. 


2 


where n~ = g/a. Since x is an increasing function of t, we take the positive sign. 


This equation is a first order separable ODE. 
It follows that 


, / dx 
nt = oo 
x1/2(x + 2p) 1/2 


=a (=) rar a 
~ 2b , 


on using the substitution x = 2b sinh* w. The initial condition x = 0 when t = 0 
implies that C = 0 and, after some simplification, we obtain 


x = b(coshnt — 1) 


as the displacement of the rope after time ¢. As before, this formula holds while there 
is still some rope left on the the table top. @ 


Example 9.7 Stability of a plank on a log 


A uniform thin rigid plank is placed on top of a rough circular log and can roll without 
slipping. Show that the equilibrium position, in which the plank rests symmetrically 
on top of the log, is stable. 
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FIGURE 9.6 A thin uniform plank is placed 
symmetrically on top of a fixed rough 
circular log. Is the equilibrium position of the 
plank stable? 


9.4 


Solution 

Suppose that the plank is disturbed from its equilibrium position and is tilted by an 
angle 6 as shown in Figure 9.6. The plank is known to ro// on the log, which means 
that the distance GC from the centre G of the plank to the contact point C must 
always be equal to the arc length of the log that has been traversed. If the radius of 
the log is a, then this arc length is ad. 

We are not yet able to calculate the kinetic energy of the plank in terms of the 
coordinate 0. This is done in the next section. However, we do not need it to investi- 
gate stability. 

The only contribution to the potential energy of the plank comes from uniform 
gravity. This is given by V = MgZ, where Z is the vertical displacement of the 
centre of mass G of the plank. Elementary trigonometry (see Figure 9.6) shows that 
Z =acosé+ a0 sin@ — a, so that 


V = Mga(cos@ + @siné — 1). 


We must now show that the constraint forces do no work. The rate of working 
of the constraint force R that the log exerts on the plank is R - v©, where v© is the 
velocity of the particle C of the plank that is instantaneously in contact with the log. 
But, since the plank rolls on the log, v© = 0 so that the rate of working of R is zero. 
Also, the internal constraint forces that enforce the rigidity of the plank do no work 
in total. Hence, the constraint forces do no work in total. 

Energy conservation therefore applies in the form 


T + Mga(cos6+6sin@ —1)=E. 


It follows that the equilibrium position (with the plank on top of the log) will be stable 
if V has a minimum at 6 = 0. Now V’ = Mgaé cos@ and V" = Mga(cos6—6 sin6) 
so that, when 6 = 0, V’ = 0 and V” = 1. Hence V has a minimum at 6 = 0 and so 
the equilibrium position is stable. ™ 


KINETIC ENERGY OF A RIGID BODY 


The general theory we have presented applies to any multi-particle system; in 


particular, it applies to the rigid array of particles that we call a rigid body. However, in 
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FIGURE 9.7 The rigid body 6 rotates about 
the fixed axis C D with angular velocity w. A 
typical particle P; moves on the circular path 
shown. 


order to make use of energy conservation in rigid body dynamics, we need to be able to 
express the kinetic energy T of the body in terms of the generalised coordinates. 


Rigid body with a fixed axis 


Figure 9.7 shows a rigid body B which is rotating about the fixed axis C D. (Imagine that 
the body is penetrated by a thin light spindle, which is smoothly pivoted in a fixed posi- 
tion.) A typical particle P; of the body can move on the circular path shown. This circle 
has radius p;, where p; is the perpendicular distance of P; from the axis CD. Suppose 
that, at some instant, the angular velocity of 6 about the axis CD is w. Then the speed of 
particle P; at this instant is |w|p;, and its kinetic energy is sm i (wpi)?. The total kinetic 
energy of B is therefore 


N N 
ra, (Si (wpi)”) =4 (dm r*) wo. 


i=1 i=l 


Definition 9.2 Moment of inertia The quantity 


N 
Iep = mipir (9.8) 


i=l 


where p; is the perpendicular distance of the mass m; from the axis CD, is called the 
moment of inertia of the body B about the axis CD. 


The moment of inertia, as defined above, does not depend on the motion of the body 
B. It is a purely geometrical quantity (like centre of mass), which describes how the mass 
in G is distributed relative to the axis C D. The further the mass in G lies from the axis, the 
larger is the moment of inertia of 6 about that axis. In the theory of rotating rigid bodies, 
the moment of inertia plays a similar réle to that played by mass in the translational motion 
of a particle. 

Our result may be summarised as follows: 
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Kinetic energy of a rigid body with a fixed axis 


Suppose the rigid body B is rotating about the fixed axis CD with angular velocity 
w. Then the kinetic energy of B is given by 


T=tIcpo’, (9.9) 


where [cp is the moment of inertia of 6 about the axis CD. 


Example 9.8 Moment of inertia of a hoop 


Find the moment of inertia of a uniform hoop of mass M and radius a about its axis 
of rotational symmetry. 


Solution 


This is the easiest case to treat since each particle of the hoop has perpendicular 
distance a from the specified axis. The required moment of inertia is therefore 


N N 
 — Yomi a= (2m) a= Ma?, 
i=l 


i=1 


where M is the mass of the whole hoop. ™ 


It is evident that, in order to solve problems that include rotating rigid bodies, we need 
to know their moments of inertia. These can be worked out from the definition (9.8), or 
its counterpart for continuous mass distributions. The Appendix at the end of the book 
contains examples of how to do this and also contains a table of common moments of 
inertia, including those for the uniform rod, hoop, disk and sphere. Most readers will 
find it convenient to remember the moments of inertia in these four cases. 


Example 9.9 Rotational kinetic energy of the Earth 


Estimate the rotational kinetic energy of the Earth, regarded as a rigid uniform sphere 
rotating about a fixed axis through its centre. 


Solution 
From the Appendix, we find that 7, the moment of inertia of a uniform sphere about 
an axis through its centre is given by J = 2M R*/5, where M is the mass of the sphere 
and R its radius. The kinetic energy of the Earth is therefore given by 

T = 51a = 4MR’o”, 
where M is the mass the Earth, R is its radius, and @ is its angular velocity. 


On inserting the values M = 6.0 x 10*4 kg, R = 6400 km and w = 7.3 x 107° 
radians per second, T = 2.6 x 107° J approximately. m 
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4m 
U 
FIGURE 9.8 Two blocks of masses m and 2m 
are connected by a light inextensible string f r 
which passes over a circular pulley of mass ne, 2m 
4m and radius a. U 


Example 9.10 Attwood’s machine 


Two blocks of masses m and 2m are connected by a light inextensible string which 
passes over a uniform circular pulley of radius a and mass 4m. Find the upward 
acceleration of the mass m. 


Solution 


The system is shown in Figure 9.8. We suppose that the string does not slip on the 
pulley and that the pulley is smoothly pivoted about its axis of symmetry. 

Let z be the upward displacement of the mass m (from some reference configura- 
tion) and v (= Z) its upward velocity at time t. Then, since the string is inextensible, 
the mass 2m must have the same displacement and velocity, but measured downwards. 
The angular velocity w of the pulley is determined from the condition that the string 
does not slip. In this case, the velocity of the rim of the pulley and the velocity of 
the string must be the same at each point where they are in contact, that is, aw = v. 
Hence w = v/a. Also, from the table in the Appendix, the moment of inertia of a 
uniform circular disk of mass M and radius a about its axis of symmetry is 5M a’. 
Hence, the total kinetic energy of the system is 


rin 
3 
2 


v\2 
c= imu? + 5 (2m)v? + 5 (5(4m)a) (-) = 


The gravitational potential energy of the system (relative to the reference con- 
figuration) is 


V =mgz — (2m)gz = —mgz. 


We must now dispose of the constraint forces. (i) At the smooth pivot that 
supports the pulley, the reactions are perpendicular to the velocities of the particles 
on which they act. Hence these reactions do no work. (ii) Since there is no slippage 
between the string and the three material bodies of the system, the total work done 
by the string on the bodies must be equal and opposite to the total work done by the 
bodies on the string.* (iii) The internal forces that keep the pulley rigid do no work in 
total. Hence the constraint forces do no work in total. 


* Since this string is massless and inextensible, it can have neither kinetic nor potential energy so that the 
total work done on the string must actually be zero. 
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Energy conservation therefore applies in the form 
5.5.12 
5mv° —mgz=E, 


where E is the total energy. If we now differentiate this equation with respect to ¢ 
(and cancel by mv), we obtain 


which is the equation of motion of the system. Thus the upward acceleration of the 
mass m is ¢/5. (If the pulley were massless, the result would be g/3.) 


Rigid body in general motion 


We now go on to find the kinetic energy of a rigid body that has translational as well as 
rotational motion. The method depends on the following theorem. 


Theorem 9.1 Suppose a general system of particles S has total mass M and that its 
centre of mass G has velocity V. Then the total kinetic energy of S can be written in the 
form 


T=5MV*+T°, (9.10) 


where V =|V| and T@ is the kinetic energy of S in its motion relative to G. 


Proof. By definition, 


N 
1 
To = =D) ami (vi — VV) + (vi —V) 


see - ; (dom ») v- sv: (dm ») +5 (Som) V 


I 
2 

=T —}(MV)-V—35V-(MV)+43M(V-V) 
ig 


as required. 


The term 5M V2 can be regarded as the translational contribution to T. When the 
system S is a rigid body, T© also has a nice physical interpretation. In this case, the 
motion of S relative to G is an angular velocity w about an axis C D passing through G, 
as shown in Figure 9.9. It then follows from equation (9.9) that 7? = 4J¢p w?. This can 
be regarded as the rotational contribution to 7. We therefore have the result: 
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a 


FIGURE 9.9 A rigid body 6 in general 

motion. The centre of mass G has velocity V 

and 6 is also rotating with angular velocity w i 
about an axis through G. iC 


Kinetic energy of a rigid body in general motion 


Let B be a rigid body of mass M and and let G be its centre of mass. Suppose that 
G has velocity V and that the body is also rotating with angular velocity w about an 
axis C D passing through G. Then the kinetic energy of B is given by 


T =4MV’ +1licpo’, (9.11) 
where V = |V| and Jcp is the moment of inertia of 6 about the axis CD. The 


term 5M V? is called the translational kinetic energy and the term slo pw the 
rotational kinetic energy of B. 


Example 9.11 Kinetic energy of a rolling wheel 


Find the kinetic energy of the rolling wheel shown in Figure 2.8. 


Solution 

Assume the wheel to be uniform with mass M and radius b. Then its centre of mass C 
has speed u so that the translational kinetic energy is 5M u>. Because of the rolling 
condition, the angular velocity of the wheel is given by w = u/b so that the rotational 
kinetic energy is 51 (u/b)*, where I = 5M b*. The total kinetic energy of the wheel 
is therefore given by 


T= $Mu? +} (5Mb") (*)= a 


Example 9.12 Cylinder rolling down a plane 


A uniform hollow circular cylinder is rolling down a rough plane inclined at an angle 
a to the horizontal. Find the acceleration of the cylinder. 


Solution 


Suppose that, at time ¢, the cylinder has displacement x down the plane (from some 
reference configuration) and that the centre of mass G of the cylinder has velocity 
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FIGURE 9.10 A hollow circular cylinder rolls 
down a plane inclined at angle a to the 
horizontal. 


Q 


v (= x) down the plane. The angular velocity w of the cylinder is then determined by 
the rolling condition to be w = v/b. The kinetic energy of the cylinder is therefore 


T =1Mv? +20? =1Mv 411 (2) 
where M is the mass of the cylinder, and / is its moment of inertia about its axis of 
symmetry. From the Appendix, we find that / = Mb so that the kinetic energy of 
the cylinder is given by T = Mv?. 

The gravitational potential energy of the cylinder is given by V = —Mgx sina. 

We must now dispose of the constraint forces. The reaction forces that the 
inclined plane exerts on the cylinder act on particles of the cylinder which, because of 
the rolling condition, have zero velocity. These reaction forces therefore do no work. 
Also the internal forces that keep the cylinder rigid do no work in total. Hence the 
constraint forces do no work in total. 

Conservation of energy therefore applies in the form 


Mv? — Mgxsina = E, 


where E is the total energy. If we now differentiate this equation with respect to ¢ 
(and cancel by Mv), we obtain 


dv |, 

— = 5gsina, 

a 

which is the equation of motion of the cylinder. Thus the acceleration of the cylin- 
der down the plane is 5 g sina. (A block sliding down a smooth plane would have 
acceleration g sina.) Hi 


Example 9.13 The sliding ladder 


A uniform ladder of length 2a is supported by a smooth horizontal floor and leans 
against a smooth vertical wall.* The ladder is released from rest in a position making 
an angle of 60° with the downward vertical. Find the energy conservation equation 
for the ladder. 


* Don’t try this at home! 
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FIGURE 9.11 A uniform ladder of mass M and length 2a is supported by a smooth 
horizontal floor and leans against a smooth vertical wall. At time ¢, its centre of mass G 
has (x, z)-coordinates (X, Z) and the ladder makes an angle 6 with the downward 


vertical. 


Solution 


Let 6 be the angle that the ladder makes with the downward vertical after time t. The 
(x, z)-coordinates of the centre of mass G are then given by 


X =asind, Z =acosé, 
and the corresponding velocity components by 
X = (acos6)6, Z = —(asin@)0. 


The angular velocity of the ladder at time ¢ is simply 6 (see Figure 9.11). The kinetic 
energy of the ladder is therefore given by 


T = 3M (X?+¥?) + 316? = 5Ma°6? + 316”, 


where M is the mass of the ladder and / is its moment of inertia about the horizontal 
axis through G. From the Appendix, we find that / = Ma*/3 so that the kinetic 
energy of the ladder is given by T = (2Ma*/3)6?. 

The gravitational potential energy of the ladder is given by V = MgZ = 
Mgacosé. 

We must now dispose of the constraint forces. The reaction forces that the smooth 
floor and wall exert on the ladder are both perpendicular to the particles of the ladder 
on which they act. These reaction forces therefore do no work. Also, the internal 
forces that keep the ladder rigid do no work in total. Hence the constraint forces do 
no work in total. 

Conservation of energy therefore applies in the form 


+Ma’ 6° + Mgacosé = E£, 


where E is the total energy. From the initial conditions 6 = 0 and @ = 2/3 when 
t = 0, it follows that E = 5M ga. The energy conservation equation for the ladder 
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FIGURE 9.12 Two particles P and Q are 
connected by a light inextensible string and 
can move, with the string taut, on the surface 
of a smooth horizontal cylinder. 


is therefore 
. 3 
62 = -2 (1 — 20088). 
4a 


Since the system has only one degree of freedom, this equation is sufficient to deter- 
mine the motion. 

A curious feature of this problem (not proved here) is that the ladder does not 
maintain contact with the wall all the way down, but leaves the wall when 6 becomes 
equal to cos~!(1/3) * 71°. m 


Problems on Chapter 9 


Answers and comments are at the end of the book. 


Harder problems carry a star (). 


Potential energy and stability 


9.1 Figure 9.12 shows two particles P and Q, of masses M and m, that can move on the 
smooth outer surface of a fixed horizontal cylinder. The particles are connected by a light 
inextensible string of length wa/2. Find the equilibrium configuration and show that it is 
unstable. 


9.2 A uniform rod of length 2a has one end smoothly pivoted at a fixed point O. The other 
end is connected to a fixed point A, which is a distance 2a vertically above O, by a light 
elastic spring of natural length a and modulus sm g. The rod moves in a vertical plane through 
O. Show that there are two equilibrium positions for the rod, and determine their stability. 
[The vertically upwards position for the rod would compress the spring to zero length and is 
excluded. ] 


9.3 The internal potential energy function for a diatomic molecule is approximated by the 
Morse potential 


2 
Vir) =Vo (1-2 9") — Vo, 


where 7 is the distance of separation of the two atoms, and Vo, a, b are positive constants. 
Make a sketch of the Morse potential. 
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FIGURE 9.13 Two blocks of masses M and m 
slide on smooth planes inclined at angles w 
and f to the horizontal. The blocks are 
connected by a light inextensible string that 
passes over a light frictionless pulley. 


Suppose the molecule is restricted to vibrational motion in which the centre of mass G of 
the molecule is fixed, and the atoms move on a fixed straight line through G. Show that there 
is a single equilibrium configuration for the molecule and that it is stable. If the atoms each 
have mass m, find the angular frequency of small vibrational oscillations of the molecule. 


9.4% The internal gravitational potential energy of a system of masses is sometimes called 
the self energy of the system. (The reference configuration is taken to be one in which the 
particles are all a great distance from each other.) Show that the self energy of a uniform 
sphere of mass M and radius R is —-3M*G/5R. [Imagine that the sphere is built up by the 
addition of successive thin layers of matter brought in from infinity.] 


Particles only 


9.5 Figure 9.13 shows two blocks of masses M and m that slide on smooth planes inclined 
at angles a and f to the horizontal. The blocks are connected by a light inextensible string 
that passes over a light frictionless pulley. Find the acceleration of the block of mass m up the 
plane, and deduce the tension in the string. 


9.6 Consider the system shown in Figure 9.12 for the special case in which the particles 
P, Q have masses 2m, m respectively. The system is released from rest in a symmetrical 
position with 0, the angle between O P and the upward vertical, equal to 7/4. Find the energy 
conservation equation for the subsequent motion in terms of the coordinate 0. 

* Find the normal reactions of the cylinder on each of the particles. Show that P is first to 
leave the cylinder and that this happens when 0 = 70° approximately. 


Ropes 


9.7 A heavy uniform rope of length 2a is draped symmetrically over a thin smooth horizontal 
peg. The rope is then disturbed slightly and begins to slide off the peg. Find the speed of the 
rope when it finally leaves the peg. 


9.8 A uniform heavy rope of length a is held at rest with its two ends close together and the 
rope hanging symmetrically below. (In this position, the rope has two long vertical segments 
connected by a small curved segment at the bottom.) One of the ends is then released. Find 
the velocity of the free end when it has descended by a distance x. 

Deduce a similar formula for the acceleration of the free end and show that it always 
exceeds g. Find how far the free end has fallen when its acceleration has risen to 5g. 
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Before 


FIGURE 9.14 The circular hoop ro//s down the slope 
from one level to another. 


FIGURE 9.15 The roll of paper moves to the right and 
the free paper is gathered on to the roll. 


9.9 A heavy uniform rope of mass M and length 4a has one end connected to a fixed point 
on a smooth horizontal table by light elastic spring of natural length a and modulus 5M g, 
while the other end hangs down over the edge of the table. When the spring has its natural 
length, the free end of the rope hangs a distance a vertically below the level of the table top. 
The system is released from rest in this position. Show that the free end of the rope executes 
simple harmonic motion, and find its period and amplitude. 


Rigid bodies 


9.10 A circular hoop is rolling with speed v along level ground when it encounters a slope 
leading to more level ground, as shown in Figure 9.14. If the hoop loses altitude h in the 
process, find its final speed. 


9.11 A uniform ball is rolling in a straight line down a rough plane inclined at an angle a to 
the horizontal. Assuming the ball to be in planar motion, find the energy conservation equation 
for the ball. Deduce the acceleration of the ball. 


9.12 A uniform circular cylinder (a yo-yo) has a light inextensible string wrapped around 
it so that it does not slip. The free end of the string is secured to a fixed point and the yo-yo 
descends in a vertical straight line with the straight part of the string also vertical. Explain why 
the string does no work on the yo-yo. Find the energy conservation equation for the yo-yo and 
deduce its acceleration. 
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9.13 Figure 9.15 shows a partially unrolled roll of paper on a horizontal floor. Initially the 
paper on the roll has radius a and the free paper is laid out in a straight line on the floor. The 
roll is then projected horizontally with speed V in such a way that the free paper is gathered 
up on to the roll. Find the speed of the roll when its radius has increased to b. [Neglect the 
bending stiffness of the paper.] Deduce that the radius of the roll when it comes to rest is 


si +1 
a ———} 
4ga 
9.14 A rigid body of general shape has mass M and can rotate freely about a fixed horizontal 
axis. The centre of mass of the body is distance h from the rotation axis, and the moment of 


inertia of the body about the rotation axis is J. Show that the period of small oscillations of 
the body about the downward equilibrium position is 


1 \12 
m(Go) 
Mgh 
Deduce the period of small oscillations of a uniform rod of length 2a, pivoted about a horizon- 
tal axis perpendicular to the rod and distance b from its centre. 


1/3 


9.15 A uniform ball of radius a can roll without slipping on the outside surface of a fixed 
sphere of (outer) radius b and centre O. Initially the ball is at rest at the highest point of the 
sphere when it is slightly disturbed. Find the speed of the centre G of the ball in terms of the 
variable 0, the angle between the line OG and the upward vertical. [Assume planar motion. ] 


9.16 A uniform ball of radius a and centre G can roll without slipping on the inside surface 
of a fixed hollow sphere of (inner) radius b and centre O. The ball undergoes planar motion in 
a vertical plane through O. Find the energy conservation equation for the ball in terms of the 
variable 0, the angle between the line OG and the downward vertical. Deduce the period of 
small oscillations of the ball about the equilibrium position. 


9.17% Figure 9.6 shows a uniform thin rigid plank of length 2b which can roll without slip- 
ping on top of a rough circular log of radius a. The plank is initially in equilibrium, resting 
symmetrically on top of the log, when it is slightly disturbed. Find the period of small oscilla- 
tions of the plank. 


Chapter Ten 
| 


The linear momentum principle 


and linear momentum conservation 


KEY FEATURES 

The key features of this chapter are the linear momentum principle; its equivalent form, 
the centre of mass equation; and conservation of linear momentum. These principles are 
applied to rocket propulsion, collision theory, the two-body problem and two-body scat- 
tering. 


This chapter is essentially based on the linear momentum principle and its con- 
sequences. The linear momentum principle is the second of the three great principles 
of multi-particle mechanics* that apply to every mechanical system without restriction. 
Under appropriate conditions, the linear momentum of a system (or one of its compo- 
nents) is conserved. Important applications include rocket propulsion, collision theory, 
the two-body problem and two-body scattering. 


10.1 LINEAR MOMENTUM 


We begin with the definition of linear momentum for a single particle and for a 
system of particles. 


Definition 10.1 Linear momentum If a particle has mass m and velocity v, then p, its 
linear momentum, is defined to be 


p=mv. (10.1) 
For a multi-particle system S consisting of particles P,, P2,..., Pj, with masses m,,mo2, 
..., My and velocities v1, v2, ..., vy (see Figure 9.1), P, the linear momentum of S, is 


defined to be the vector sum of the linear momenta of the individual particles, that is, 


N 
P=) p; =) imu. (10.2) 
i=1 


* The other two are the energy and angular momentum principles. 
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Newton’s Second Law can be written in terms of linear momentum in the form 


Although this offers no advantage in the mechanics of a single particle, we will find that 
this type of formulation is very useful in multi-particle mechanics. The expression (10.2) 
can be written simply in terms of the motion of G, the centre of mass of S. Since the 
position vector R of G is given by 


Sy MiP; 
ie mi 


where rj is the position vector of the particle P;, it follows that V, the velocity of G is 


R= 


given by 


where M (= )° m;) is the total mass of the system S. Hence 
P=MV. (10.3) 
Thus the /inear momentum of any system is the same as if all its mass were concentrated 


at its centre of mass. 


Although true for all systems, this result is most useful when finding the linear 
momentum of a moving rigid body. Note that the rotational motion of the rigid body 
does not contribute to its linear momentum; this contrasts with the corresponding calcula- 
tion of the kinetic energy of a rigid body (see Chapter 9). 


10.2 THE LINEAR MOMENTUM PRINCIPLE 


We now derive the fundamental result which relates the linear momentum of any 
system to the external forces that act upon it: the linear momentum principle. 
Suppose that the system S is acted upon by the external forces {F;} and internal 
forces {G;;}, as shown in Figure 9.3. Then the equation of motion for the particle P; is 


d ‘ 
mi = Fi + Gi, (10.4) 


where, as in Chapter 9, we take G;; = 0 wheni = j. Then the rate of increase of the 
linear momentum of the system S can be written 


N N 
dP d dv; 
dt dt (Sm) : mi a’ = 
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which, on using the equation of motion (10.4), gives 


4 


fay prt} Gy{= Sor + DG, 


j=l i=l j=l 


a 


N 


pt 
Py a a Y> (Gij + Gj) ; 
j=l 


i= 


II 
= 1 


| 
= 


where the terms of the double sum have been grouped in pairs and those terms known 
to be zero have been omitted. Now the internal forces {G;;} satisfy the Third Law, so 
that G ;; = —G;;. Hence, each term of the double sum in equation (10.5) is zero and we 
obtain 


Linear momentum principle 
dP (10.6) 
=F 
dt 


where F is the total external force acting on S. This is the linear momentum principle. 
This fundamental principle can be expressed as follows: 


Linear momentum principle 


In any motion of a system, the rate of increase of its linear momentum is equal to 
the total external force acting upon it. 


It should be noted that only the external forces appear in the linear momentum prin- 
ciple so that the internal forces need not be known. It is this fact which gives the linear 
momentum principle its power. 


10.3. MOTION OF THE CENTRE OF MASS 


The linear momentum principle can be written in an alternative form called the 
centre of mass equation, which is more useful for some purposes. If we substitute the 
expession (10.3) for P into the linear momentum principle (10.6) we obtain 


Centre of mass equation 


dv (10.7) 
__=—F- 
dt 
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which is called the centre of mass equation. It has the form of an equation of motion 
for a fictitious particle of mass M situated at the centre of mass, which moves under the 
total of the external forces acting on the system S. This important result can be simply 
expressed as follows: 


Motion of the centre of mass 


The centre of mass of any system moves as if it were a particle of mass the total 
mass, and all the external forces acted upon it. 


Example 10.1 Jumping cat 


A cat leaps off a table and lands on the floor. Show that, while the cat is in the air, its 
centre of mass moves on a parabolic path. 


Solution 
While the cat is in the air, the total external force on its body is due to uniform gravity, 
that is, F = —M gk. The centre of mass equation for the cat is therefore 
dV 
M — = —Mgk, 
dt = 


which is precisely the equation of projectile motion for a single particle. The path 
of the centre of mass of the cat is therefore the same as if it were a particle of mass 
M moving freely under uniform gravity. This path is known (see Chapter 4) to be a 
parabola. & 


In previous examples, we have often used the Second Law to find an unknown con- 
straint force acting on a particle, once the motion of a system has been found by other 
means (see, for instance, Example 6.13). The centre of mass equation allows us to do 
the same thing when the unknown constraint force acts on a rigid body. The following 
examples illustrate the method. 


Example 10.2 Cylinder rolling down an inclined plane 


Consider again a hollow cylinder of mass M rolling down a rough inclined plane as 
shown in Figure 9.10. In Example 9.12, energy conservation was used to show that 
the acceleration of the cylinder down the plane is 5 g sina. Deduce the reaction force 
exerted by the plane on the cylinder. 


Solution 

Suppose that the component of the reaction force normal to the plane is N, while 
the component of the reaction force up the plane is F’. (The plane is rough so both 
components are present.) The cylinder is therefore subject to these ‘two’ external 
forces together with uniform gravity. The centre of mass equation for the cylinder 
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(when resolved into components tangential and normal to the plane) is given by 


d 
M— = Mgsina — F, 0 = N — Mgcosa, 


where dvu/dt = 5 g sina. It follows that the required reactions are given by 


F = 5Mgsina, N = Mgcosa. 


Thus, if F and N are restricted by the ‘law of friction’ F/N < jy, then the supposed 
rolling motion of the cylinder cannot take place if tana > 2. 


Example 10.3 Sliding ladder 


Consider again the uniform ladder of length 2a supported by a smooth horizontal 
floor and leaning against a smooth vertical wall, as shown in Figure 9.11. The ladder 
is released from rest with 6, the angle between the ladder and the downward vertical, 
equal to 60°. In Example 9.13, we used energy conservation to show that, in the 
subsequent motion, 6 satisfies the differential equation 


Q2 = a —2cos@), 
a 


provided that the ladder maintains contact with the wall. Deduce that the ladder loses 
contact with the wall when 6 = cos~!(1/3). 


Solution 


Let the normal reactions exerted on the ladder by the smooth floor and wall be N* 
and N respectively. Then the centre of mass equation for the ladder, resolved into 
horizontal and vertical components, is given by 


MX =N", MZ=N* — Mg, 


where (X, Z) are the coordinates of the centre of mass of the ladder (see Figure 9.11). 
Hence 


N’ =MZ+Mg, NW = MX. 


Now, in terms of the angle 0, X = asin@ and Z = acos@. On differentiating twice 
with respect to ¢, we obtain the corresponding acceleration components 


X = —a(sin6) 6* + a(cos6) 6, 


Z= —a(cos 0) — a(sin 0) 6. 
Hence 
WP =i ((cosé) 62 — (sin6) 6 ) 4+ Mg, 


NW = Ma (— (ino) 6 re (cos) 6 ) 
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In order to express these reactions in terms of 6 alone, we need to know 6? and 6 as 
functions of 6. From the previously derived equation of motion, we already have 


= 


: 
= — 2.008 8) 


and, if we differentiate this equation with respect to ¢ (and cancel by 6), we obtain 


ag 
4a 


6= sin. 


On making use of the above expressions for 6? and 6, the required reactions are 
found to be 


_ Msg 
~ 4 


nF MS 


3 
(1—3cos6+9cos?0), NW = sin 0(3 cos @ — 1). 


We observe that the predicted value of N“ becomes zero when @ = cos! (1/3) 
and is negative thereafter. Since negative values of N™ cannot occur (the wall can 
only push), we conclude that the condition that the ladder maintains contact with the 


wall is violated when 9 > cos~!(1 /3). Therefore, the ladder leaves the wall when 
6 =cos!(1/3). 


10.4 CONSERVATION OF LINEAR MOMENTUM 


Suppose that S is an isolated system, meaning that no external force acts on any 
of its particles. Then F’, the total external force acting on S, is obviously zero. The linear 
momentum principle (10.6) for S then takes the form dP/dt = 0, which implies that P 
must remain constant. This simple but important result can be stated as follows: 


Conservation of linear momentum 


In any motion of an isolated system, the total linear momentum is conserved. 


It follows from equation (10.3) that the above result can also be stated in the alter- 
native form ‘Jn any motion of an isolated system, the centre of mass of the system moves 
with constant velocity’. Clearly the same result applies to any system for which the fotal 
external force is zero, whether isolated or not. 

It is also possible for a particular component of P to be conserved while other com- 
ponents are not. Let nm be a constant unit vector and suppose that F -n = 0 at all times. 
Then 

d dP P. dn dP 


“n)= n+ — = —-n=F-n=0. 
dt dt dt dt 


Hence the component P - n is conserved. This result can be stated as follows: 
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Before ejection 


—~r U—U —> UV 


After ejection 


FIGURE 10.1 A rigid body of mass M (the rocket) contains a removable rigid block 
of mass m (the fuel). An internal source of energy causes the fuel block to be ejected 
backwards with speed u relative to the rocket and the rocket is projected forwards. 


If the total force acting on a system has zero component in a fixed direction, then, 
in any motion of the system, the component of the total linear momentum in that 
direction is conserved. 


Conservation of a component of linear momentum 


Conservation of linear momentum is an important property of a system and the sec- 


tions that follow rely heavily upon it. Two examples of momentum conservation are as 


follows: 


10.5 


e The solar system is an example of an isolated system, being extremely remote 


from any other masses. It follows that the total linear momentum of the solar 
system is conserved. Thus the centre of mass of the solar system moves with 
constant velocity. 


On the other hand, a grasshopper trying to move on a perfectly smooth horizontal 
table is not isolated, being subject to gravity and the vertical reaction of the table. 
However, since the grasshopper is not subject to any external horizontal force, it 
follows that, whatever the grasshopper tries to do, his component of total linear 
momentum in any horizontal direction is conserved. His vertical component of 
linear momentum is not conserved; he can leap into the air if he wishes. 


ROCKET MOTION 


An important application of linear momentum conservation is rocket propulsion. 


Figure 10.1 shows a rigid body of mass M (the rocket) which contains a removable rigid 


block of mass m (the fuel). The system is at rest when an internal source of energy causes 


the fuel block to be ejected backwards with speed u relative to the rocket. If the system is 


isolated, then its total linear momentum is conserved, which implies that 


Mv+m(v—u)=0 
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Initially M 


After time 7 m(T) 


mass (—m) dt 


FIGURE 10.2 The rocket and its fuel at times t = 0 and t = t. The element of fuel 
ejected in the time interval [t, t + dt] has mass (—m)dt and (forward) velocity 
v(t) —u. 


where v is the forward velocity of the rocket after the ejection of the fuel. As a result of 
this process the rocket acquires the forward velocity 


Ca 
v= | ——  ]u 
M+m 


This is the basic principle of rocket propulsion. The only mechanically significant dif- 
ference between the simple example above and real rocket propulsion is that, in the case 
of the real rocket, the fuel mass is ejected continuously over a period of time and not in 
a single lump. In practice, the fuel is burned continuously and the combustion products 
eject themselves due to their rapid expansion. 


Rocket motion in free space 


Figure 10.2 shows a more realistic situation. Initially the rocket and its fuel have combined 
mass mg and are moving with constant velocity vo. At time ¢ = 0 the motors are started 
and fuel products are ejected backwards with speed wu relative to the rocket. The fuel 
‘burn’ continues for a time 7,, at the end of which the rocket and unburned fuel have mass 
m,. Letm = m(t) be the mass of the rocket and its unburned fuel after time t. Then m is a 
decreasing function of t and the rate of ejection of mass at time f is —m. Let the system S 
consist of the rocket together with its fuel at time ¢ = 0. After some time t into the burn, 
the mass of S is distributed as shown in Figure 10.2. The rocket and unburned fuel have 
mass m(t) and the remaining mass has been ejected as expended fuel. We will suppose 
that, once an element of fuel is ejected, it continues to move with the velocity it had at 
the instant of ejection.* Since we are assuming S to be an isolated system, its total linear 
momentum is conserved. The initial linear momentum in this one-dimensional problem 
iS movo and the final linear momentum of the rocket and unburned fuel is m(t)v(t), 


* This assumption simplifies our derivation but, as we will see, it is not essential. 
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where v (= v(f)) is the velocity of the rocket at time fr. It remains to take account of 
the linear momentum of the ejected fuel. Consider the element of fuel that was ejected 
in the time interval [t, f + dt]. This has mass (—m(t)) dt and its forward velocity at the 
instant of ejection was v(t) — u. The linear momentum of this fuel element is therefore 
(—m)(v — u) dt and the total linear momentum of the fuel expended in the time interval 


[O, tT] is 
-f m(v — u) dt. 
0 


Linear momentum conservation for the system S therefore requires that 


movg = m(T)v(T) — [ m(v —u) dt, 
0 


which can be written in the form 


[ E (mv) —m(v | dt = 0. 


Since this equality must hold for any choice of t during the burn, it follows that the 


integrand must be zero, that is 


e )— m( )=0 
ae mv —u) = 


for0 <t <T. This simplifies to give 


Rocket equation in free space 


aii (10.8) 
m— = (—m)u 


dt 


the rocket equation, which holds for 0 < ¢ < T. The rocket equation can be interpreted 
physically as the Second Law applied to a system of variable mass* m(t), namely the 
rocket and its unburned fuel. In this interpretation, the term on the right, —mu, plays the 
role of force and is called the thrust supplied by the motors. 


Note. In our derivation, we assumed that, once an element fuel is ejected, it continues to move with 


the velocity it had at the instant of ejection. This is equivalent to assuming that each element of ejected 


* This terminology is undesirable since, in classical mechanics, a ‘system’ means a fixed set of masses (or, 
at the very least, fixed total mass). No standard mechanical principle applies to a ‘system’ whose total 
mass is changing with time. 
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fuel is isolated from other fuel and from the rocket. It clearly makes no difference to the momentum of 
the ejected fuel if momentum is exchanged between elements of itself so that this assumption is actually 
unnecessary. However we must retain the assumption that ejected fuel has no further interaction with the 
rocket. This seems likely to be true in free space, but whether it is true just after take off from solid ground 


is questionable. 


Providing that the ejection speed u is constant, the rocket equation (10.8) can easily be 
solved for any mass ejection rate. On dividing through by m and integrating with respect 
to t, we obtain 


(—m)u dm 
dv= dt =—u = —ulnm-+ constant 
m m 


and, on applying the initial condition v = vp when t = 0, we obtain 


1) =v +uIn dl 


m(t) 


This gives the rocket velocity at time ¢. In particular, at the end of the fuel burn, the rocket 
velocity has increased by 


mo 

Av =v, —v9 =uln( — ]}, (10.9) 
my 

where m, and v; are the final mass and velocity of the rocket. One can make some 

interesting deductions from this solution. 


(i) Av, the increase in the rocket velocity, is directly proportional to u, the fuel ejection 
speed. Thus it pays to make w as large as possible. Chemical processes can produce 
values of u as high as 5000 ms~!. 

(ii) If the fuel were all ejected in a single lump, Av would never exceed the ejection 
speed u. But when the fuel is ejected over a period of time, it is possible for 
the rocket to attain any velocity by making the mass ratio mg/m , large enough. 
For example, if we wish to make Av = 3u, then we need mg/m, = e*? ~& 20. 
This means that 19 kg of fuel would be required for every kilogram of payload. 
The amount of fuel needed to achieve higher velocities quickly makes the process 
impractical. To achieve Av = 10u takes 22 metric tons of fuel for every kilogram 
of payload! 


Rocket motion under gravity 


Suppose now that the rocket is moving vertically under gravity. If we regard the govern- 
ing equation as the equation of motion for the variable mass m(t), then, when gravity is 
introduced, the equation of motion becomes 


Rocket equation including gravity 
As (10.10) 


m— = (—m)u — mg 


dt 
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where v is measured vertically upwards, and the weight force mg means m(t)g. In this 
case, the effective force on the right is the sum of the thrust (—m)u acting upwards and 
the weight force mg acting downwards. 

When the gravity is uniform and the ejection speed u is constant, the new rocket 
equation (10.10) can also be solved easily for any mass ejection rate. On dividing through 
by m and integrating with respect to t, we obtain 


(—m)u 
dv= — g| dt = —ulnm + gt + constant 


m 
and, on applying the initial condition v = vp when t = 0, we obtain 


mo 
v(t) = vg tuln (=") — gt. 


m(t) 


This gives the rocket velocity at time f. In particular, at the end of the fuel burn, the rocket 
velocity has increased by 


mo 
Av= v1 m =uin(2) — eT, (10.11) 
my 


where mj, and v, are the final mass and velocity of the rocket and T is the time taken to 
burn all the fuel. 

It will be noticed that, if T is too large, then Av will be negative, which is hardly 
possible for a rocket standing on the ground. The reason for this paradox is that, if the fuel 
is burned too slowly then the thrust will be less than the initial weight of the rocket, which 
will not take off until its weight has become less than the thrust. We will therefore assume 
that (—m)u > mg at all times during the burn so that the rocket has positive upward 
acceleration and achieves its maximum speed when ¢t = T. If the rocket starts from rest, 
it then follows that the maximum speed achieved is 


mo 
Umax = ulin (~) — eT. 
m\ 


In this and the zero gravity case, the distance travelled during the burn depends on the 
functional form of m(t). 


10.6 COLLISION THEORY 


Another important application of linear momentum conservation occurs when we 
have an isolated system of two particles, and one particle is in collision with the other. 


Collision processes 


It is important to understand the meaning of the term ‘collision’. Suppose that the mutual 
interaction between the two particles tends to zero as the distance between them tends to 
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Before After 


e-— Uu @ 


FIGURE 10.3 A collision between two particles viewed from the laboratory frame. 
A particle of mass m, and initial velocity wu collides with a ‘target’ particle of mass 
mz, which is initially at rest. After the collision, the particles have velocities wu, and 
uz respectively. 0; is the scattering angle of the mass 771, 42 is the recoil angle of 
the mass 2, and 6 (= 6; + 62) is the opening angle between the emerging paths. 


infinity, so that, if the particles are initially a great distance apart, each must be moving 
with constant velocity. If the particles approach each other, then there follows a period 
during which their mutual interaction causes their straight line motions to be disturbed. If 
the particles finally retreat to a great distance from each other, then they will move with 
constant velocities again, and these final velocities will generally be different to the initial 
velocities. This is what we mean by a collision process. Note that a collision process 
is not restricted to those cases in which the particles make physical contact with each 
other. This can of course happen, as in the ‘real’ collision of two pool balls. However, the 
deflection suffered by an alpha particle in passing close to a nucleus is also a ‘collision’, 
even though the alpha particle and the nucleus never made contact. Collision processes are 
particularly important in nuclear and particle physics, where they are the major source 
of experimental information. 


General collisions 


Consider the collision shown in Figure 10.3. A particle of mass m, and initial velocity u 
is incident upon a ‘target’ particle of mass m2 which is initially at rest.* This is typical 
of the collisions observed in nuclear physics. After the collision we will suppose the 
particles retain their identities (and therefore their masses) and emerge with velocities 1 
and u2 respectively. How are the final and initial motions of the particles related? Clearly 
we cannot ‘solve the problem’ since we have not even said what the mutual interaction 
between the particles is. However, it is surprising how much can be deduced simply from 
conservation laws without any detailed knowledge of the interaction. 

Since the two particles form an isolated system, their total linear momentum is con- 
served, that is, 


* This means ‘at rest in the laboratory reference frame’. 
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myu= muy +m2u2 (10.12) 


This linear relation between the vectors uw, uw, and u2 implies that these three velocities 
must lie in the same plane so that scattering processes are two-dimensional. 
Generally, collisions are not energy preserving. The energy principle for the colli- 

sion has the form 

1 2 1 2 1 2 

xmyu" + Q = smyujy + 5M2U3, 
where u = |u|, uy = |uy|, U2 = |u2|, and Q is the energy gained in the collision. In 
‘real’ collisions between large bodies, energy is usually lost in the form of heat, so that Q 


is negative. However, in nuclear collisions in which the particles change their identities, it 
is perfectly possible for energy to be gained. 


Example 10.4 Making Kraptons 


A little known particle physicist has proposed the existence of a new particle, with 
charge +2 and mass 2, which he has named the Krapton. He has calculated that this 
can be produced by the collision of two protons in the reaction* 


p* +pt+10MevV > K*t 


Having failed to obtain funding to verify his theory, he has built his own equipment 
with which he accelerates protons to an energy of 16 MeV and uses them to bombard 
a Stationary target of hydrogen. Could he succeed in making a Krapton? 


Solution 

Suppose a proton with kinetic energy E collides with proton at rest. Then this sys- 
tem has initial linear momentum (2mE)!/2, where m is the mass of a proton. This 
linear momentum is preserved by the collision so that, if a Krapton of mass 2m were 
produced, it would have linear momentum (2mE)!/? and therefore kinetic energy 
E/2. Hence, only 8 MeV of the initial energy is available for Krapton building and, 
according to the physicist’s own calculation, this is not enough. (On the other hand, a 
head-on collision between two 5 MeV protons would be enough. Why?) 


Elastic collisions 


The linear momentum equation (10.12) holds whether the collision is between pool balls, 
protons or peaches. Much more can be said if the collision is also energy preserving. 


Definition 10.2 Elastic collision A collision between particles is said to be elastic if 
the total kinetic energy of the particles is conserved in the collision. 


* The electron volt (eV) is a unit of energy equal to 1.6 x 10-19 J approximately. 
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Frame invariance In order that the above definition be physically meaningful, it is necessary that a colli- 
sion observed to be elastic in one inertial frame should also be elastic when observed from any other. This 
is not obviously true, since kinetic energy is not a linear quantity. However, since the total kinetic energy 


of the system can be written in the form T = TOM + TG (see Theorem 9.10), where TOM 


is preserved in 
the collision and T© is frame independent, it follows that any gain or loss of kinetic energy in the collision 


is independent of the inertial reference frame used to observe the event. 


Elastic collisions are very common and extremely important. For example, any col- 
lision in which the mutual interaction force is conservative is elastic. In particular, the 
collisions that occur in Rutherford scattering are elastic. In elastic collisions, we have 
energy conservation in the form 


mu? = Amyuy + dm (10.13) 


Nie 


and, together with linear momentum conservation (10.12), we can make some interesting 
deductions. If we take the scalar product of each side of the linear momentum equation 
(10.12) with itself, we obtain 

miu = miuy + 2m mu, + un + mau5, 
and, if we now eliminate the term in u” between this equation and the energy conservation 
equation (10.13), we obtain, after simplification, 


2m uy + Uy = (my — m2) us. (10.14) 


Since uy + U2 = u,u2 cos 8, where 6 is the opening angle between the paths of the emerg- 
ing particles, the formula (10.14) can also be written 


no (10.15) 
2m uy 


provided that u; 0, that is, provided that the incident particle is not brought to rest by 
* This formula holds for all elastic collisions, whatever the nature of the 
particles and the interaction. It therefore applies equally well to pool balls‘ and protons, 


the collision. 


but not peaches. Given the mass ratio of the two particles, formula (10.15) relates the 
speeds of the particles and the opening angle between their paths after the collision. 


* The incident particle can be brought to rest in a head-on collision with a particle of equal mass. 
¥ Collisions between pool balls are very nearly elastic. However, in the present treatment, we are disre- 
garding the rotation of the balls. 
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Example 10.5 Finding the final energies 


A ball of mass m and (kinetic) energy E is in an elastic collision with a second ball 
of mass 4m that is initially at rest. The two balls depart in directions making an angle 
of 120° with each other. What are the final energies of the two balls? 


Solution 


On substituting the given data into the formula (10.15), we find that u;/u2 = 3. It 
follows that 


Ey _ 5muy =1(") 9 
Ey 5(4m)u3 4 


Hence E, = E and £2 = 4 EL. @ 


An important special case occurs when the two particles have equal masses. In this 
case, formula (10.15) shows that the opening angle must always be a right angle. Thus, in 
an elastic collision between particles of equal mass, the particles depart in directions at 
right angles. Note that this result applies only when the target particle is initially at rest. 


Example 10.6 Elastic collision between two electrons 


In an elastic collision between an electron with kinetic energy E and an electron at 
rest, the incoming electron is observed to be deflected through an angle of 30°. What 
are the energies of the two electrons after the collision? 


Solution 

Since the collision is elastic and the electrons have equal mass, the opening angle 
between the emerging paths must be 90°. The target electron must therefore recoil at 
an angle of 60° to the initial direction of the incoming electron. Let the speed of the 
incoming electron be u and speeds of the electrons after the collision be wu; and u2 
respectively. Then conservation of linear momentum implies that 


mu = mu, cos 30° + mu? cos 60°, 


0 = muy sin30° — muz sin 60°, 


which gives uj = 5Vv3 u and u2 = pu. Hence, after the collision, the electrons have 
energies 25 and zE respectively. i 


10.7 COLLISION PROCESSES IN THE ZERO-MOMENTUM 
FRAME 


We have so far supposed that the inertial reference frame from which the scatter- 
ing process is observed is the one occupied by the experimental observer. This is called the 
laboratory frame (or lab frame) since it is the frame in which measurements (of scatter- 
ing angles, for instance) are actually taken. In the lab frame, the target particle is initially 
at rest. 
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FIGURE 10.4 A collision between two particles viewed from the zero-momentum frame. 
The initial momenta p,, p> are equal and opposite, as are the final momenta p',, p',. The 
angle w is the angle through which each of the masses is scattered. 


However, it is very convenient to ‘view’ the scattering process from a different inertial 
frame. Since the two particles form an isolated system, their centre of mass G moves with 
constant velocity and so the frame* in which G is at rest is inertial. In this frame, the 
total linear momentum of the two particles is zero and, for this reason, we call it the 
zero-momentum frame’ or ZM frame. 

Consider, for example, the scattering problem which, in the lab frame, is shown in 
Figure 10.3. Then the total linear momentum P = mj u and the velocity V of the centre 
of mass of the two particles is 


yo (10.16) 
mi +m? 


This therefore is the velocity of the ZM frame relative to the lab frame for this collision 
process. 


Collisions viewed from the ZM frame 


Two-particle collisions look simple when viewed from the ZM frame. This is because, 
since the total linear momentum is now zero, the initial linear momenta p,, p> of the two 
particles and the final momenta p’,, p/, of the two particles must satisfy 


Pi t+ po =9, Pi +p ,=9. (10.17) 


Thus, when a two-particle collision is viewed from the ZM frame, the initial momenta 
are equal and opposite and so are the final momenta. Figure 10.4 shows what a two- 
particle collision looks like when viewed from the ZM frame. Because of the relations 


* This frame has the same velocity as G, and no rotation, relative to the lab frame. 
* The term ‘centre of mass frame’ is also used. However, ‘zero-momentum frame’ is preferable since this 
notion holds good in relativistic mechanics. 
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(10.17), the particles both arrive and depart in opposite directions, so that each particle is 
deflected through that same angle y. All this follows solely from conservation of linear 
momentum. We can say more if we also have an energy principle of the form 


2 2 2 2 
5m i|vi|> + 5ma|vo\- + O = 5mi|v) > + 5mg| v5)’, 


where v1, V2, Wis v5 are the initial and final velocities of the particles (as shown in Figure 
10.4), and Q is the kinetic energy gained as a result of the collision.* Let p be the common 
magnitude of the initial momenta p,, p>, and p’ be the common magnitude of the final 
momenta p',, p’,. Then the energy balance equation can be re-written in the form 


p? p n _ ne a 
2m, 2m 2m, = 2m’ 
that is, 
20mm 
2 =p % (en (10.18) 
m, +m2 


Thus the magnitudes of the initial and final momenta are related through Q, the energy 
gained in the collision. This is depicted in the momentum diagram in Figure 10.4. The 
magnitudes of the initial and final momenta (p and p’) are represented by the radii of the 
two dashed circles. The diagram shows the case in which p’ > p, which corresponds to 
Q > 0. For an elastic collision, the circles are coincident and all four momenta have 
equal magnitudes. 

In a typical scattering problem, the masses m1, mz and the initial momenta p,, p> are 
known. For the scattering problem shown (in the lab frame) in Figure 10.3, v1} = u—V 
and v2 = —V, where V is given by equation (10.16). It follows that the initial momentum 
magnitude in the ZM frame are given by 


mim2u 
p= ——_ (10.19) 
mi +m 
where u = |u|. The scattering process is now entirely determined by the parameters Q 
(the energy gain) and wy (the ZM scattering angle). Given p and Q, p’ is determined 
from equation (10.18). Together with w, this determines the final momenta p, and p>. 
The parameters Q and w depend on the physics of the actual collision. For instance, the 
collision may be known to be elastic, in which case Q = 0. The question of how the 
scattering angle y is related to the actual interaction and initial conditions is addressed in 
section 10.9. 


Returning to the lab frame (elastic collisions only) 


Although the scattering process looks simpler in the ZM frame, we usually need to know 
the details of the scattering actually observed by the experimenter in the lab frame. This 


* As remarked earlier, Q is frame independent and so is the same as the energy gain measured in the lab 
frame. 
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FIGURE 10.5 The final particle velocities u1, 
Uz in the lab frame are obtained from the 
final velocities v}, v5, in the ZM frame by the 
relations uw; = v, + V, uz = v5, + V. The 
diagram shows the elastic case, in which the 
velocity triangle for wz is isosceles. 


entails transforming the properties of the final state (velocities, momenta and kinetic ener- 
gies) from the ZM frame back to the lab frame. Since the ZM frame has velocity V (given 
by (10.16)) relative to the lab frame, the final velocities u;, uz observed in the lab frame 
are related to the final velocities v‘, v, in the ZM frame by 


uj =v, +V, ur =v, +V. (10.20) 


Any other properties can then be found from uw, u2. The transformations (10.20) are 
depicted geometrically in Figure 10.5. The transformation formulae become rather com- 
plicated in the general case, but simplify nicely when the collision is elastic. From now on 
we will restrict ourselves to elastic collisions only. In this case, Q = 0 and the collisions 
are parametrised by w alone. The energy equation (10.18) then implies that p’ = p so 
that the four momentum magnitudes are equal, and given by equation (10.19). The final 
speeds of the particles in the ZM frame are therefore given by 

mu 


= ———; v 
m, +m 


mu 
= —— =/J, (10.21) 
m, +m? 


| 


where V = |V|. We may now deduce the required information from Figure 10.5. The lab 
scattering angle 0; can be expressed in terms of the parameter angle y by 


v; sin sin w sin wy 


t 0 = => — ‘ 
ee v,cosw+V  cosw+(V/v}) cos + (mj/mz) 


on using equations (10.21). The lab recoil angle 62 is easily found since, in an elastic 
collision, the velocity triangle for w2 is isosceles with angles w, 02 and 62, as shown in 
Figure 10.5. It follows that 


0 = 5(a — yp). 
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The expression for the lab opening angle 6 (= 6; + 02) is therefore given by 


tan@ = tan(0; + 42) = = 


tan 0; + tan 62 m, +m 
1+tan6,tand. — 


) cots, 


m,—mpy 


after some simplification. 
To find the final energies, we observe that 


uz = 2V sin(S) 


so that Fo, the final lab energy of the mass 77 is given by 


2 
1 in(2 
E> 5m? (2v sin(3¥)) 4m\m2 . 2/4 
_ ; = sin“(5¥), 


Eo smu (m, +m2)? 


where Eo (= 5mu") is the lab energy of the incident mass m,. Since the collision is 
elastic, the final lab energy of the mass 7m, is simply deduced from the energy conservation 
formula E; + Eo = Ep. 

The above formulae give the properties of the final state following an elastic two- 
particle collision in terms of the ZM scattering angle w. We will call them the elastic 
collision formulae and they are summarised below: 


Elastic collision formulae 


sin vy 1 
A. tan@,; = ————— B. 62 = 5(m -— 
an 6} ees 2=5(7—-) 
Vie ll i E2 ae eal 
C. tan@ = | —— t(5 D. = 
an (27) cots a ie sin” (5) 
(10.22) 


w is the scattering angle in the ZM frame, and y = m;/mz2, the mass ratio of the 
two particles. 


Using the elastic collision formulae A word of advice about the use of these formulae may 
be helpful. Most questions on this topic tell you some property of the scattering in the lab 
frame and ask you to find another property of the scattering in the lab frame; the ZM frame is 
never mentioned. /t is inadvisable to start manipulating the elastic scattering formulae. This 
is almost guaranteed to cause errors. The simplest method is as follows: (i) Use the given data 
to find y by using the appropriate formula ‘backwards’, and then (ii) use this value of y to 
find the required scattering property. In short, the advice is ‘go via i’. 
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Example 10.7 Using the elastic scattering formulae 


In an experiment, particles of mass m and energy E are used to bombard stationary 
target particles of mass 2m. 


Q. The experimenters wish to select particles that, after scattering, have energy E/3. 
At what scattering angle will they find such particles? 


A. If E,/Eo = 1/3, then by energy conservation E2/Eo9 = 2/3. First use formula D 
to find y. Since the mass ratio y = 1/2, this gives 


2 
a= = sin?(}0). 


so that wy = 120°. Now use formula A to find the scattering angle 6,. This gives 
tan@,; = oo so that 6; = 90°. Particles scattered with energy E'/3 will therefore be 
found emerging at right angles to the incident beam. 


Q. In one collision, the opening angle was measured to be 45°. What were the indi- 
vidual scattering and recoil angles? 


A. First use formula C to find y. This gives 
1 1 
cotsw) = 5, 


so that sv = 72°, to the nearest degree. Now use formula B to find the recoil angle 
62. This gives 62 = 90° — 72° = 18°. The scattering angle 6; must therefore be 
0; = 0 — & = 45° — 18° = 27°. 


Q. In another collision, the scattering angle was measured to be 45°. What was the 
recoil angle? 


A. First use formula A to find y. This shows that w satisfies the equation 
2cosw —2sinyw = 1, 

which can be written in the form* 
V8 cos (Wy +45°) = 1. 


This gives y = 24°, to the nearest degree. Formula B now gives the recoil angle 62 
to be 62 = 78°, to the nearest degree. 


THE TWO-BODY PROBLEM 


The problem of determining the motion of two particles, moving solely under 


their mutual interaction, is called the two-body problem. Strictly speaking, all of the orbit 


* Recall that equations of the form acos y + bsinw = c are solved by writing the left side in the ‘polar 
form’ R cos(w — a), where R2 = a2 +b? and tana = b/a. 
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FIGURE 10.6 The motion of P; and P2 relative to their centre of mass (left), and the motion of P 
relative to P> (right). 


problems considered in Chapter 7 should have been treated as two-body problems since 
centres of force are never actually fixed. The one-body theory is a good approximation 
when one particle is much more massive than the other. When the two particles have 
similar masses, the problem must be treated by two-body theory, in which neither particle 
is assumed to be fixed. 

Let P; and P> be two particles moving under their mutual interaction. By the Third 
Law, the forces that they exert on each other are equal in magnitude, opposite in direction, 
and act along the line joining them. We will further suppose that the magnitude of these 
interaction forces depends only on, the distance separating P; from P2. The forces F1, 
Fo, acting on P}, P2, then have the form 


Pyeleyr, F.=-—-F(r)f, 


where r = rj — r2,r = |r| — r2| and? = r/r (see Figure 10.6). The equations of 
motion for P;, Pz are therefore 


mF, = F(r)7, mor, = —F (ry. ae 


This is a generalisation of central force motion in which each particle moves under a force 
centred upon the other particle. Although this problem appears to be complicated, it can 
be quickly reduced to an equivalent one-body problem. 

We first observe that the two particles form an isolated system so that their total lin- 
ear momentum is conserved, or (equivalently) their centre of mass G moves with con- 
stant velocity. The motion of G is therefore determined from the initial conditions and it 
remains to find the motion of each particle relative to G, that is, their motions in the ZM 
frame. It turns out however that it is easier to find the motion of one particle relative to 
the other. The motion of each particle relative to G can then be easily deduced. 


The equation of relative motion 
It follows from the equations of motion (10.23) that 


F(r)r m FOr _ (mi ting 
~ \ mym2 


Fi —Fro= ) Fenn, 


mM m2 
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so that r, the position vector of P; relative to P2, satisfies the equation 


Relative motion equation 


mim. \.. - (10.24) 
—— ]r=Frir)r, 
m,+mo9 
which we call the relative motion equation. 
Definition 10.3 Reduced mass The quantity 1, defined by 
ie (10.25) 
mi +m 


is called the reduced mass. 


Our result can be expressed as follows: 


Two-body problem — the relative motion 


In the two-body problem, the motion of P; relative to P2 is the same as if Po were 
held fixed and P; had the reduced mass ju instead of its actual mass m1. 


This rule* allows us to replace the problem of the motion of P; relative to Py by an 
equivalent one-body problem in which P? is fixed. The solution of such problems is 
fully described in Chapter 7. 


Example 10.8 Escape from a free gravitating body 


Two particles P; and P2, with masses m,; and m, can move freely under their mutual 
gravitation. Initially both particles are at rest and separated by a distance c. With 
what speed must P; be projected so as to escape from P2? 


Solution 

Since this is a mutual gravitation problem, we take our rule in the form: The motion 
of P, relative to P2 is the same as if Pz were held fixed and the constant of gravitation 
G replaced by G', where 


Ge (= | GC. 


mg 


* The rule is ambiguous when the force F also depends on mj, as in mutual gravitation. Do you also 
replace this m, by 2? The answer is no, but the easiest way to avoid this glitch in the mutual gravitation 
problem is to make the transformation G + (my, + m2)G/mz instead. This has the correct effect and is 
not ambiguous. 
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FIGURE 10.7 Particles P; and P2 move under their mutual gravitation. In the zero 
momentum frame, the orbits are similar conics, each with a focus at G (left). The orbit of 
P, relative to P2 is a third similar conic with P2 at a focus (right). 


From the one-body theory in Chapter 7, we know that P; will escape from a fixed P2 
if it has positive energy, that is if 


m,im2G 
1 1/72 = 


5mV" - 0. 


c 


Hence, when P? is not fixed, P; will escape if 


mym2G" 


sm\V" _ > 0, 


Cc 
that is, if 


2(m, +m2)G 
Se 
c 


y2 


This is the required escape condition. & 


Once the relative motion of the particles has been found, one may easily deduce the 
motion of each particle in the ZM frame since (see Figure 10.6) 


m2 my 
ro = ——— _]r, a ——— ]r 
m, +m) m, +m) 


It follows that the orbits of P;, P2 in the ZM frame are geometrically similar to the orbit 

in the relative motion. For instance, suppose that the mutual interaction of P; and P2 is 
gravitational attraction, and that the orbit of P, relative to Pz has been found to be an 
ellipse. Then the orbits of P; and Pz in the ZM frame are similar ellipses, as shown in 
Figure 10.7. The ratio of the major axes of these orbits is m2 : mj, and the sum of their 
major axes is equal to the major axis of the orbit of P; relative to Po. All three orbits have 
the same period t given by 


2 4n*a3 


= ——_., (10.26) 
G (mi +mz2) 
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where a is the semi-major axis of the relative orbit. This formula is simply obtained from 
the one-body period formula (7.26) by replacing G by G’. 

Formula (10.26) shows that, in the approximate treatment in which P2 is regarded 
as fixed, the value of the period is overestimated by the factor 


( a. 
1+ —s. > 
m2 


which is a small correction when m /mz is small. In the Solar system, the largest value 
of m,/mz for a planetary orbit is that for Jupiter, which is about 1/1000. 


Binary stars 


It is probable that over half of the ‘stars’ in our galaxy are not single stars, like the Sun, 
but occur in pairs* that move under their mutual gravitation. Such a pair is called a binary 
star. 

Binary stars are important in astronomy and also provide a nice application of our 
two-body theory. In particular, the two components of the binary must orbit their centre of 
mass on similar ellipses, as shown in Figure 10.7; the orbit of either component relative to 
the other is a third similar ellipse; and the period of all three motions is given by formula 
(10.26), where a is the semi-major axis of the relative orbit. 

One reason why binary stars are important in astronomy is that the masses of their 
component stars can be found by direct measurement; indeed they are the only stars for 
which this can be done. Suppose that the star is an optical binary, which means that both 
components are visible through a suitably large telescope. Then the period of the binary 
can be measured by direct observation. It is also possible to measure the major axis of the 
relative orbit. Once t and a are known, formula (10.26) tells us the sum of the masses of 
the two components of the binary. 


Example 10.9 Sirius A and B 


A typical example of a binary is Sirius in the constellation Canis Major, the brightest 
star in the night sky. The large bright component is called Sirius A and its small dim 
companion Sirius B. The period of their mutual orbital motion is 50 years and the 
value of a is 20 AU. (This is about the distance from the Sun to the planet Uranus.) 
Find the sum of the masses of the two components of Sirius. 


Solution 
In terms of astronomical units, in which G = 477”, formula (10.26) gives 


3 


20 
MavrMg=— 5 =32Mou 


* Groups of three or more also occur. 
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In order to determine the individual masses of the components by optical means, it is 
necessary to find the a values for one of the individual components in its motion relative to 
the centre of mass. The procedure is essentially the same as before, but much more difficult 
observationally since the motion of the chosen component must be measured absolutely, 
that is, relative to background stars. In the case of Sirius, it is found that M4 = 2.1 Mo 
and Mg = 1.1 Mo. 


10.9 TWO-BODY SCATTERING 


An important application of two-body motion is the two-body scattering prob- 
lem. In our treatment of collision theory, we considered the whole class of possible colli- 
sions between two particles that were consistent with momentum and energy conservation. 
These collisions were parametrised by the ZM scattering angle w. We now consider the 
problem in more detail. Given the interaction between the particles and the impact param- 
eter p, what is the resulting ZM scattering angle? This question can be answered by using 
two-body theory. We break up the process into a number of steps: 


1. Find the {p, 6}-relation for the one-body problem 


First consider the one-body problem in which the particle P2 is held fixed, and work 
out (or look up) the relation between the impact parameter p and the scattering angle 0. 
For example, the {p, 0}—relation for Rutherford scattering was derived in Chapter 7 and 
was found to be 
9142 


tanl@ = oe 
my, pu 


2 


(10.27) 


2. Find the {p, ¢}-relation for the relative motion problem 


The next step is to find the relation between the impact parameter p and the scattering 
angle ¢ observed in the relative motion problem. This is easily obtained from the one- 
body formula (10.27) by replacing mj, by yu (the reduced mass) and replacing @ by ¢. This 
gives 


qiqzd + y) 
my pu2 


tan5¢ = (10.28) 


where y (= m/z) is the ratio of the two masses. 
3. Find the {p, w}-relation observed in the ZM frame 


The angle ¢ that appears in the formula (10.28) is the scattering angle in the motion 
of m, relative to m2. However, by an amazing stroke of good fortune, it is actually the 
same angle as the ZM scattering angle y that we used in collision theory.* Hence, the 


* The reason is as follows: The relative motion in the lab frame must be the same as the relative motion in 
the ZM frame. In this frame, the initial relative velocity of P; is equal to (py/m 1) — (p2/mz), which 
has the same direction as p,;. Likewise, the final relative velocity of P; is equal to ( P /m1) —( Ps /m?), 
which has the same direction as P'. Hence the scattering angle in the relative motion is the same as that 
in the ZM frame. 
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{p, w}Helation when two-body scattering is observed from the ZM frame is obtained 
by simply replacing ¢ in formula (10.28) by y, that is, 


agi t+y) 


; (10.29) 
my pu 


1 
tan5v = 


As always, u means the speed of the incident particle observed in the lab frame. 
4. Find 6; and 6, in terms of p from the elastic collision formulae 


Since the {p, y}-relation (10.29) gives the ZM scattering angle wy in terms of p, this 
expression for yw can now be substituted into the elastic scattering formulae (10.22A) and 
(10.22B) to give expressions for the two-body scattering angle 6,, and recoil angle 62 in 
terms of p. For Rutherford scattering, this gives, after some simplification, 


Two-body Rutherford scattering formulae 


4q\q2pE 2pE une) 


tan 0; = tan 0. = ———____ 
4p? E? — (1 — y)qiqs gigi +y) 


where E (= sm \u7) is the energy of the incident particle and y = m1 /mo. 
These formulae simplify further when the particles have equal masses. In this special 
case, y = | and the scattering and recoil angles are given by 


E 
Hep mh = (10.31) 


pE q1q2- 


(As expected, 6; + 62 = 51.) These formulae would apply, for example, to the scattering 
of alpha particles by helium nucei. 


Two-body scattering cross section 
Having found the {p, 6;}-relation for the two-body scattering problem, the two-body 


scattering cross section a! 8 is given, in principle, by the formula 
pap 
07? (1) = - 
sin 0; dQ, 


However, this requires that the {p, 6; }—relation be solved to give p as a function of 6; and 
the resulting algebra is formidable. 

The following method has the advantage that o/ 8 is determined directly from the cor- 
responding one-body scattering cross section. The trick is to introduce the ZM scattering 
angle y. By the chain rule, 


dp dp dw 
= 4 
do, dw do 


3 
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TB can be written 


ra___P 4p __ Ps g dy 
sin 0; d0, snO; \dw do 


_ (sny dw p dp 
7 (=*) (=) ( oo) 
_ (smy dy ZM 

_ (=) (=) on), 


where o™ is defined by 


and so o 


on 


p_dp 
sinyw dw 


Now o7™ (y) is easily obtained from the one-body cross-section o (9) by replacing m, 


ow) = - (10.32) 


by yu and @ by yy. The two-body cross section is then given by 


Two-body scattering cross section 


TB _ (sind \ (dv \ _zu (10.33) 
o oy =(S") (Bo (W) 


In this formula, we have yet to replace y by its expression in terms of 6). To do this, we 
must invert the formula (10.22 A) to obtain w as a function of 6;. Formula (10.22 A) can 
be rearranged in the form 


sin(w — 6;) = y sin@), 


from which we obtain 


wv = 6, +sin!(y sin 61) (10.34) 
and, by differentiation, 
d 6 
ee (10.35) 
do, (1 — y? sin® 6,)!/2 


These expressions for w and dy/d6, in terms of 6; must now be substituted into equation 
(10.33) to obtain the final formula for the two-body scattering cross section o/ 8 (6). 
These operations can be done with computer assistance. 

For example, in Rutherford scattering, we first obtain o“™ (y) by replacing m, by 
i (and 6 by y) in the one-body cross section formula (7.37) obtained in Chapter 7. This 
gives 


Qed 2 
oy) = qndt+y) (= (10.36) 


2,4 41 
4mju aW 
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(q2q2 /16E?) o™® 


4 7y=0 (one-body theory) 
am 


FIGURE 10.8 The Rutherford two-body 
scattering cross section o’ plotted against 
the scattering angle 6; (7/2 < 6; < 2) for 
various values of the mass ratio 

y (=m /mz2). E is the kinetic energy of the 
incident particles. 


A, 


The two-body Rutherford scattering cross section is now obtained by substituting the 
expression (10.36) into the general formula (10.33) and then replacing yw and dw/d6 by 
the expressions (10.34), (10.35). After much manipulation, the answer is found to be 


(10.37) 


re 193 ( 4 +y7)°(y cos + S)? 
16E* \ S(1+y sin? 6; — cos S$)? ] ’ 


where 
1/2 
S= (1 — y* sin? 61) 


and EF (= smu") is the energy of the incident particle. 

Figure 10.8 shows graphs of o/ 8 (6;) in Rutherford scattering for various choices of 
the mass ratio y. In Rutherford’s actual experiment with alpha particles and gold nucleii, 
the value of y was about 0.02 and the error in the scattering cross section caused by using 
the one-body theory was less than 0.1%. However, as the graphs show, larger values of y 
can give rise to a substantial deviation from the one-body theory. 

When the mass ratio y (= m,/mz) is small, the formula (10.37) is approximated by 


Did 
TB _ 4192 1 2 4 
git = 2 +0( )). 
16E? (So ” d 


Thus, when y is small, the leading correction to the one-body approximation is a constant. 


Equal masses 


The whole process of finding o/ simplifies wonderfully when the two particles have 


equal masses. In this case, y = 26,, dw/d0, = 2, and the general formula (10.33) 
becomes 


0! ® (61) = 4cos 6, o~™ (261) (0 <6 < 2/2). 
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For example, in Rutherford scattering where the particles have equal masses, a! 8 has the 
simple form 


2.2 
TR 4195 [ cos 
6;) = —= | —— 0 <6, < 7/2). 
07 (61) = 15 (4) (0 <6 < 2/2) 


This formula would apply, for example, to the scattering of protons by protons. 


10.10 INTEGRABLE MECHANICAL SYSTEMS 


A mechanical system is said to be integrable if its equations of motion are sol- 
uble in the sense that they can be reduced to integrations.* The most important class 
of integrable systems are those that satisfy as many conservation principles as they have 
degrees of freedom. Suppose that a mechanical system S has n degrees of freedom and 
that it satisfies n conservation principles. Then it is certainly true that the 7 conservation 
equations are sufficient to determine the motion of the system, in the sense that no more 
equations are needed. More importantly though, it can be shown" that these equations can 
always be reduced to integrations. The system S is therefore integrable. 

Before we can apply this method to particular systems, there is a kinematical prob- 
lem to be overcome, namely: how does one find the velocities (and angular velocities) 
of the elements? of S when there are two or more generalised coordinates which vary 
simultaneously? The answer is by drawing a velocity diagram for S as described below: 


Drawing a velocity diagram 
e Draw the system in general position and select a set of generalised coordinates. 


e Let the first generalised coordinate vary (with the other coordinates held constant) 
and mark in the velocity of each element. 


e Now let the second generalised coordinate vary (with the other coordinates held 
constant) and, on the same diagram, mark in the velocity of each element. Con- 
tinue in this way through all the generalised coordinates. 


e Then, when all the generalised coordinates are varying simultaneously, the veloc- 
ity of each element of S is the vector sum of the velocities given to that element 
when the coordinates vary individually. 


In the above, ‘velocity’ means ‘velocity and/or angular velocity’. 


* The system is still said to be integrable even when the integrals cannot be evaluated in terms of standard 
functions! 

+ This is Liouville’s theorem on integrable systems (see Problem 14.15) 

+ The elements of S are the particles and/or rigid bodies of which S is made up. One needs to find (i) the 
velocity of each particle, (ii) the velocity of the centre of mass of each rigid body, and (111) the angular 
velocity of each rigid body, in each case in terms of the chosen coordinates and their time derivatives. 
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© Do (a) me 
| ) /) . 


FIGURE 10.9 Constructing a velocity diagram. Figure (a) shows the system and the 
coordinates x and @. Figure (b) shows the velocities generated when x varies with @ held 
constant. Figure (c) shows the velocities generated when @ varies with x held constant. 
Figure (d) is the velocity diagram which is formed by superposing the velocities in 
diagrams (b) and (c). Note that the velocity of P2 is the vector sum of the two contributions 


shown. 


Example 10.10 Drawing a velocity diagram 1 


The system shown in Figure 10.9 consists of two particles P; and Pz connected by 
a light inextensible string of length a. The particle P; is also constrained to move 
along a fixed horizontal rail and the whole system moves in the vertical plane through 
the rail. Take the variables x and 6 shown as generalised coordinates and draw the 
velocity diagram. 


Solution 
The construction of the velocity diagram is shown in Figure 10.9. @ 


Example 10.11 Drawing a velocity diagram 2 


Two rigid rods CD and DE, of lengths 2a and 25, are flexibly jointed at D and can 
move freely on a horizontal table. Choose generalised coordinates and draw a velocity 
diagram for this system. 


Solution 

Let Oxy be a system of Cartesian coordinates in the plane of the table. Let (X, Y) be 
the Cartesian coordinates of the centre of the rod CD, and let 6 and ¢ be the angles 
that the two rods make with positive x-axis. Then X, Y, 0, ¢ are a set of generalised 
coordinates for this system. These coordinates, and the corresponding velocity dia- 
gram are shown in Figure 10.10. There are four contributions to the velocity of the 
centre of the rod DE. Also, each rod has an angular velocity. Hl 


We will now solve the system shown in Figure 10.9 by using conservation principles. 
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FIGURE 10.10 The velocity diagram for a system with four degrees of freedom. The figure on the 
left shows the system and the generalised coordinates X, Y, 0, @. The figure on the right is the 
completed velocity diagram. 


Example 10.12 Solving an integrable system 


Consider the system shown in Figure 10.9 for the case in which P; and Pz have 
masses 3m and m, the rail is smooth, and the system moves under uniform gravity. 
Initially, the system is released from rest with the string making an angle of 2/3 with 
the downward vertical. Use conservation principles to obtain two equations for the 
subsequent motion. 


Solution 

Let i be the unit vector parallel to the rail (in the direction of increasing x). Since 
the rail is smooth, all the external forces on the system are vertical which means that 
F -i = 0. This implies that P + i, the horizontal component of the total linear 
momentum, is conserved. From the velocity diagram, the value of P + i at time ¢ is 
given by 


P -i =3mx +m (x + (a6) cos) = 4mxk + ma cos. 


Also, since the motion is started from rest, P - i = 0 initially. Hence, conservation 
of P - i implies that 


4% + a0 cos6 = 0, (10.38) 


on cancelling by m. This is our first equation for the subsequent motion. 
Since the rail is smooth, the constraint force exerted by the rail does no work and 
the tensions in the inextensible string do no total work. Hence energy is conserved. 


276 Chapter 10 The linear momentum principle 


From the velocity diagram, the kinetic energy of the system at time f is given by* 
T = $Qm)i* + 4m (x? + (ad)? + 2% (a6) cos) 
= sm (4:7 + a’6* + 2ax6 cos 6) : 
The gravitational potential energy of the system at time ¢ is given by 
V =0-—mgacosé. 


Since the system was released from rest with 6 = 60°, the initial value of T is zero, 
while the initial value of V = — sm ga. Hence, conservation of energy implies that 


5m (4:7 + a’6* + 2ax6 cos 6) — mgacos@ = —5mga, 
which simplifies to give 
4x? + a7? + 2ax6 cos6 = ga (2cos6 — 1). (10.39) 


This is our second equation for the subsequent motion. 

Since this system has two degrees of freedom and satisfies two conservation prin- 
ciples, it must be integrable. Hence, the conservation equations (10.38), (10.39) must 
be soluble in the sense described above. ™ 


Question Equation for 6 


Deduce an equation satisfied by @ alone and find the speeds of P; and P2 when the 
string becomes vertical. 


Answer 
From the linear momentum equation (10.38), 


x= — 7a cos 6 


and, if we now eliminate x from the energy equation (10.39), we obtain, after simpli- 
fication, 


1» 4g (2cosd — 1 
p= 8 () (10.40) 
a \4—cos-6 


which is an equation for @ alone. 

It follows from this equation that, when the string becomes vertical (that is, when 
6=0),67 = 4¢/3a. Hence, at this instant, 6 = —(4g/3a)'/? and (from the momen- 
tum conservation equation) x = +(g/12a)'/*. Hence, the speed of P is (ag/12)!/? 
and the speed of P> is |x + a6| = ag /4)!/?. = 


* Suppose a velocity V is the sum of two contributions, v; and v2, so that V = vy + v2. Then 
|V|? =V-V = (vy +02) + (vy + 02) =v +] +2 + V7 +20 + v2 = |v1|> + |vgI7 + 201 + VD. 


This formula was used to find the kinetic energy of particle Po. 
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Question Period of oscillation 


Find the period of oscillation of the system. 


Answer 

From the equation (10.40), it follows that the motion is restricted to those values 
of @ that make the right side positive, and that 6 = 0 when the right side is zero. 
Hence, @ oscillates periodically in the range —7/3 < 6 < 2/3. Consider the first 
half-oscilliation. In this part of the motion, 6 < Oand so @ satisfies the equation 


: 4¢ 2 /9eo8e —1\'" 
0 ad ? 
a 4 — cos? 6 
a first order separable ODE. On separating, we find that t, the period of a full oscil- 
lation, is given by 


a\'/2 2/3 (4 — cos? 6 Me a\ 1/2 
tT=[{— / —____—_. + 6.23 | — : 
g _n/3 \2cosé — 1 g 
Thus the determination of 6(t) has been reduced to an integration, and, with 0(t) 


‘known’, the equation(10.38) can be solved to give x(f) as an integral. This confirms 
that the system is integrable. @ 


Appendix A Modelling bodies by particles 


When can a large body, such as a tennis ball, a spacecraft, or the Earth, be modelled by a 
particle? 


The answer commonly given is that ‘a body may be modelled by a particle if its size is 
small compared with the extent of its motion’. For example, since the radius of the Earth is 
small compared with the radius of its solar orbit, it is argued that the Earth may be modelled 
by a particle, at least in respect of its translational motion. This argument sounds reasonable 
enough, but it is derived only from intuition and, although it often gives the correct answer, it 
is not the correct condition at all! 

We can make some more definite statements on this quite tricky question by using the 
centre of mass equation. This states that ‘the centre of mass of any system moves as if it were 
a particle of mass the total mass, and all the external forces acted upon it’. It might appear 
that this principle enables us to predict the motion of the centre of mass of any system, but 
this is not so. The reason is that, in general, the total external force acting on a system does 
not depend solely on the motion of its centre of mass; it may depend on the positions of the 
individual particles and also other factors such as the particle velocities. Suppose, for example, 
that the system is a rigid body of general shape moving under the gravitational attraction of a 
fixed mass. Then the total gravitational force acting on the body is only approximately given 
by supposing all the mass to be concentrated at the centre of mass G. The exact force depends 
on the orientation of the body as well as the position of G. The centre of mass equation tells us 
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nothing about this orientation and so the total force on the body is not known and the motion 
of G cannot be determined. 


There are however some important exceptions: 


© Consider a rigid body moving without rotation. In this case the motion of G deter- 
mines the motion of every particle of the body. Then the total external force on the 
body is known and the motion of G can be determined. This, in turn, determines 
the motion of the whole body. For example, the problem of a block sliding without 
rotation on a table can be completely solved by particle mechanics. 


e Consider any system moving solely under uniform gravity. In this case, the total 
external force on the system is a known constant and the motion of G can be deter- 
mined. This does not however determine the motion of the individual particles. For 
example, if the system were a brick thrown through the air, then particle mechanics 
can calculate exactly where its centre of mass will go, but not which particle of the 
brick will hit the ground first. 


In the general case however, we must use approximations. For example, suppose that the 
particles of the system move in the force fields F'; (1) so that the total force on the system is 


N 
> Fie. 


i=1 
In general, this is not equal to )> F;(R). We can however approximate )~ Fi (ri) by )- Fi(R), 
in which case we are assuming that the ratio 
|Fi (ri) — Fi(R)| 
|F;(R)| 


<1 (10.41) 


for all 7. In the following argument we investigate when this condition can be expected to be 
hold. 


Let 5; be the position vector of the particle P; of the system relative to G. Then 


) [5;+ 0 (18:7). 


where d F; /ds means the directional derivative of F; in the direction of the displacement 6;. The condition 
(10.41) therefore requires that 

; ) 1d; | 

S lr=R 


|F;(R)| 


dF; 
F(R +5)) — F(R) = ( ds 


r= 


for all i and for all values of R that are attained in the motion of the system. This will hold if 
FR 

( dF; 
max 


ds a) 


for all i and for all points on the path of the centre of mass. Here ‘max’ means the maximum over all 
directions, and A is the ‘radius’ of the system (the maximum distance of any particle of the system from the 
centre of mass). Thus the radius of the system is required to be small compared with the quantities above, 
not the lateral extent of the motion. 


A < (10.42) 
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Although the condition (10.42) looks formidable, its physical meaning is quite simple: the 
radius of the system is required to be small compared with a length scale over which any of 
the force fields vary significantly. 

Consider for example a body moving under the gravitational attraction of a mass Mg which 
is fixed at the origin O. In this field, the particle P; of the system moves under the force field 


m;MoG ~ 


2 rj. 
ri 


Fi(r) = - 


For this field, the right side of the condition (10.42) evaluates to give R/2, where R (= | R|) is 
the distance of the centre of mass of the body from O. Therefore the total gravitational force 
on the body will be accurately approximated by the force 


MMG ~ 
R 
R2 


FRy= = 


(where M is the total mass of the body), if A < R at each point on the path of the centre 
of mass. This means that the radius of the body must be small compared with its distance of 
closest approach to the centre O. This condition has no direct connection with the ‘extent of 
the motion’. Indeed, on a hyperbolic orbit, the path is infinite, but the condition (10.42) will 
not hold if the path passes too close to the centre of force. Similar remarks apply to motion of a 
body in any central field governed by a power law. If however the field were that corresponding 
to the Yukawa potential 
er /a 


V=-k , 
r 


where k, a are positive constants, then A is required to be small compared with the length 
scale a as well as the distance of closest approach to O. 


Problems on Chapter 10 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Linear momentum principle & centre of mass equation 


10.1 Show that, if a system moves from one state of rest to another over a certain time interval, 
then the average of the total external force over this time interval must be zero. 

An hourglass of mass M stands on a fixed platform which also measures the apparent 
weight of the hourglass. The sand is at rest in the upper chamber when, at time ¢ = 0, a 
tiny disturbance causes the sand to start running through. The sand comes to rest in the lower 
chamber after a time t = t. Find the time average of the appararent weight of the hourglass 
over the time interval [0, t]. [The apparent weight of the hourglass is however not constant 
in time. One can advance an argument that, when the sand is steadily running through, the 
apparent weight of the hourglass exceeds the real weight! ] 


10.2 Show that, if a system moves periodically, then the average of the total external force 
over a period of the motion must be zero. 
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A juggler juggles four balls of masses M, 2M,3M and 4M in a periodic manner. Find 
the time average (over a period) of the total force he applies to the balls. The juggler wishes 
to cross a shaky bridge that cannot support the combined weight of the juggler and his balls. 
Would it help if he juggles his balls while he crosses? 


10.3% A boat of mass M is at rest in still water and a man of mass m is sitting at the bow. The 
man stands up, walks to the stern of the boat and then sits down again. If the water offers a 
resistance to the motion of the boat proportional to the velocity of the boat, show that the boat 
will eventually come to rest at its orginal position. [This remarkable result is independent of 
the resistance constant and the details of the man’s motion.] 


10.4 A uniform rope of mass M and length a is held at rest with its two ends close together 
and the rope hanging symmetrically below. (In this position, the rope has two long vertical 
segments connected by a small curved segment at the bottom.) One of the ends is then released. 
It can be shown by energy conservation (see Problem 9.8) that the velocity of the free end when 
it has descended by a distance x is given by 


> (““*) 
vu =| ———]¢ 
a-x 


Find the reaction R exerted by the support at the fixed end when the free end has descended a 
distance x. The support will collapse if R exceeds 3M g. Find how far the free end will fall 
before this happens. 


10.5 A fine uniform chain of mass M and length a is held at rest hanging vertically down- 
wards with its lower end just touching a fixed horizontal table. The chain is then released. 
Show that, while the chain is falling, the force that the chain exerts on the table is always three 
times the weight of chain actually lying on the table. [Assume that, before hitting the table, 
the chain falls freely under gravity. ] 

* When all the chain has landed on the table, the loose end is pulled upwards with the 
constant force iM g. Find the height to which the chain will first rise. [This time, assume that 
the force exerted on the chain by the table is equal to the weight of chain lying on the table.] 


10.6 A uniform ball of mass M and radius a can roll without slipping on the rough outer 
surface of a fixed sphere of radius b and centre O. Initially the ball is at rest at the highest 
point of the sphere when it is slightly disturbed. Find the speed of the centre G of the ball 
in terms of the variable 0, the angle between the line OG and the upward vertical. [Assume 
planar motion.] Show that the ball will leave the sphere when cos 6 = i 

Rocket motion 


10.7 A rocket of initial mass M, of which M — m is fuel, burns its fuel at a constant rate 
in time t and ejects the exhaust gases with constant speed u. The rocket starts from rest and 
moves vertically under uniform gravity. Show that the maximum speed acheived by the rocket 
is uIny and that its height at burnout is 
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where y = M/m. [Assume that the thrust is such that the rocket takes off immediately. ] 


10.8 Saturn V rocket In first stage of the Saturn V rocket, the initial mass was 2.8 x 10° kg, of 
which 2.1 x 10° kg was fuel. The fuel was burned at a constant rate over 150 s and the exhaust 
speed was 2, 600 ms~!. Use the results of the last problem to find the speed and height of the 
Saturn V at first stage burnout. [Take g to be constant at 9.8 ms~ and neglect air resistance. ] 


10.9 Rocket in resisting medium A rocket of initial mass M, of which M — m is fuel, burns 
its fuel at a constant rate k and ejects the exhaust gases with constant speed u. The rocket 
starts from rest and moves through a medium that exerts the resistance force —ekv, where v is 
the forward velocity of the rocket, and € is a small positive constant. Gravity is absent. Find 
the maximum speed V achieved by the rocket. Deduce a two term approximation for V, valid 
when € is small. 


10.10 Two-stage rocket A two-stage rocket has a first stage of initial mass M1, of which 
(1 — n)M, is fuel, a second stage of initial mass M2, of which (1 — 7) M2 is fuel, and an inert 
payload of mass mo. In each stage, the exhaust gases are ejected with the same speed u. The 
rocket is initially at rest in free space. The first stage is fired and, on completion, the first 
stage carcass (of mass 7M 1) is discarded. The second stage is then fired. Find an expression 
for the final speed V of the rocket and deduce that V will be maximised when the mass ratio 
a = M2/(M, + M2) satisfies the equation 


a? + 2Ba — B =0, 


where 8 = mo/(M, + M2). [Messy algebra. ] 
Show that, when £ is small, the optimum value of a is approximatelely 6!/? and the 
maximum velocity reached is approximately 2u Iny, where y = 1/7. 


10.11% A raindrop falls vertically through stationary mist, collecting mass as it falls. The 
raindrop remains spherical and the rate of mass accretion is proportional to its speed and the 
square of its radius. Show that, if the drop starts from rest with a negligible radius, then it has 
constant acceleration g/7. [Tricky ODE.] 


Collisions 


10.12 A body of mass 4m is at rest when it explodes into three fragments of masses 2m, m 
and m. After the explosion the two fragments of mass m are observed to be moving with the 
same speed in directions making 120° with each other. Find the proportion of the total kinetic 
energy carried by each fragment. 


10.13 Show that, in an elastic head-on collision between two spheres, the relative velocity of 
the spheres after impact is the negative of the relative velocity before impact. 

A tube is fixed in the vertical position with its lower end on a horizontal floor. A ball of 
mass M is released from rest at the top of the tube followed closely by a second ball of mass 
m. The first ball bounces off the floor and immediately collides with the second ball coming 
down. Assuming that both collisions are elastic, show that, when m/M is small, the second 
ball will be projected upwards to a height nearly nine times the length of the tube. 
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10.14 Two particles with masses m1, m2 and velocities v;, v2 collide and stick together. Find 
the velocity of this composite particle and show that the loss in kinetic energy due to the 
collision is 


mm? 2 
|v) — v9| 
2(m, +m?) 


10.15 In an elastic collision between a proton moving with speed uw and a helium nucleus at 
rest, the proton was scattered through an angle of 45°. What proportion of its initial energy 
did it lose? What was the recoil angle of the helium nucleus? 


10.16 In an elastic collision between an alpha particle and an unknown nucleus at rest, the 
alpha particle was deflected through a right angle and lost 40% of its energy. Identify the 
mystery nucleus. 


10.17 Some inequalities in elastic collisions Use the elastic scattering formulae to show the 
following inequalities: 


(i) When m, > m2, the scattering angle 0; is restricted to the range 0 < 0; < sin! (m2/m}1). 
(ii) If mm < mz, the opening angle is obtuse, while, if m1 > mz, the opening angle is acute. 


(iii) 


2 
Ei = (= =) E> < 4m m2 
Eo mi +m Eo ~ (m, +m)? 


10.18 Equal masses Show that, when the particles are of equal mass, the elastic scattering 
formulae take the simple form 


A=5W h=5n-5W O= 


where y is the scattering angle in the ZM frame. 

In the scattering of neutrons of energy E by neutrons at rest, in what directions should the 
experimenter look to find neutrons of energy Ee ? What other energies would be observed in 
these directions? 


10.19 Use the elastic scattering formulae to express the energy of the scattered particle as a 
function of the scattering angle, and the energy of the recoiling particle as a function of the 
recoil angle, as follows: 


2 


’ 


E> dy 


E; 1+ y?cos26; +2y cos (1- y? sin? 6, 
7 Eo (+b? 


2 
Eo (y + 1p — 


Make polar plots of E;/Eo as a function of 6; for the case of neutrons scattered by the nuclei 
of hydrogen, deuterium, helium and carbon. 


Two-body problem and two-body scattering 


10.20 Binary star The observed period of the binary star Cygnus X-1 (of which only one 
component is visible) is 5.6 days, and the semi-major axis of the orbit of the visible component 
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is about 0.09 AU. The mass of the visible component is believed to be about 20M. Estimate 
the mass of its dark companion. [Requires the numerical solution of a cubic equation. ] 


10.21 In two-body elastic scattering, show that the angular distribution of the recoiling parti- 
cles is given by 


4cos 6) 07™ (1 — 262), 


where o“™ (yy) is defined by equation (10.32). 
In a Rutherford scattering experiment, alpha particles of energy E were scattered by a 
target of ionised helium. Find the angular distribution of the emerging particles. 


10.22% Consider two-body elastic scattering in which the incident particles have energy Eo. 
Show that the energies of the recoiling particles lie in the interval 0 < E < Emax, where 
Emax = 4y Eo/(1+y)?. Show further that the energies of the recoiling particles are distributed 
over the interval 0 < E < Emax by the frequency distribution 


f(E)= (= )o2M an, 


max 


where o™ is defined by equation (10.32), and 


In the elastic scattering of neutrons of energy Ep by protons at rest, the energies of the recoiling 
protons were found to be uniformly distributed over the interval 0 < E < Eo, the total cross 
section being A. Find the angular distribution of the recoiling protons and the scattering cross 
section of the incident neutrons. 


Integrable systems 


10.23 A particle QO has mass 2m and two other particles P, R, each of mass m, are connected 

to Q by light inextensible strings of length a. The system is free to move on a smooth hori- 

zontal table. Initially P, Q R are at the points (0, a), (0, 0), (0, —a) respectively so that they 

lie in a straight line with the strings taut. Q is then projected in the positive x-direction with 

speed u. Express the conservation of linear momentum and energy for this system in terms of 

the coordinates x (the displacement of Q) and 6 (the angle turned by each of the strings). 
Show that @ satisfies the equation 


j2 uz 1 
~ a2 \2—cos26 


and deduce that P and R will collide after a time 


1 


m/2 L 
“f [2 - cos*e]* dé. 
a) 
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2 —~> ka ° > J=0 
10 20 30 1 O 


FIGURE 10.11 The quantum mechanical solution of the problem in which a uniform beam of 
particles, each with momentum hk, is scattered by an impenetrable sphere of radius a. Left: The 
(dimensionless) total cross section (s7a*)~'S against ka. Right: A polar graph of the (dimensionless) 
scattering cross section (a? /4)~!o (0) against 9 when ka = 30. 


10.24 A uniform rod of length 2a has its lower end in contact with a smooth horizontal table. 
Initially the rod is released from rest in a position making an angle of 60° with the upward 
vertical. Express the conservation of linear momentum and energy for this system in terms 
of the coordinates x (the horizontal displacement of the centre of mass of the rod) and 6 (the 
angle between the rod and the upward vertical). Deduce that the centre of mass of the rod 
moves in a vertical straight line, and that 6 satisifies the equation 


g2 — 38 1—2cosé 
~ a \4—3cos26)° 


Find how long it takes for the rod to hit the table. 


Computer assisted problems 


10.25 Two-body Rutherford scattering Calculate the two-body scattering cross section o! 8 


for Rutherford scattering and obtain the graphs shown in Figure 10.8. Obtain also an approxi- 
mate formula for 0! valid for small y (= m1/mz), and correct to order O(y?). 


10.26 Comparison with quantum scattering A uniform flux of particles is incident upon a 
fixed hard sphere of radius a. The particles that strike the sphere are reflected elastically. Show 
that the differential scattering cross section is o (9) = a?/4 and that the total cross section is 
S =na?. 

The solution of the same problem given by quantum mechanics is 


a |e (21 + 1) jy (ka) P) (cos 6) . Ama? | (21 +1) ju(ka) |? 
aaa: dX, hi (ka) » = aye dX hikkay 


where P/(z) is the Legendre polynomial of degree /, and j;(z), h;(z) are spherical Bessel 
functions order /. (Stay cool: these special functions should be available on your computer 
package.) The parameter k is related to the particle momentum p by the formula p = hk, 
where fi is the modified Planck constant. When ka is large, one would expect the quantum 
mechanical values for o(@) and S to approach the classical values. Calculate the quantum 
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mechanical values numerically for ka up to about 30 (the calculation becomes increasingly 
difficult as ka increases), using about 100 terms of the series. 

The author’s results are shown in Figure 10.11. The quantum mechanical value for o (@) 
does approach the classical value for larger scattering angles, but behaves very erratically for 
small scattering angles. Also, the value of S tends to twice the value expected! Your physics 
lecturer will be pleased to explain these interesting anomalies. 


Chapter Eleven 
SSS SSS SSS eam 


The angular momentum principle 


and angular momentum conservation 


KEY FEATURES 

The key features of this chapter are the angular momentum principle and conservation 
of angular momentum. Together, the linear and angular momentum principles provide the 
governing equations of rigid body motion. 


This chapter is essentially based on the angular momentum principle and its conse- 
quences. The angular momentum principle is the last of the three great principles of multi- 
particle mechanics* that apply to every mechanical system without restriction. Under 
appropriate conditions, the angular momentum of a system (or one of its components) is 
conserved, and we use this conservation principle to solve a variety of problems. 

Together, the linear and angular momentum principles provide the governing equa- 
tions of rigid body motion; the linear momentum principle determines the translational 
motion of the centre of mass, while the angular momentum principle determines the rota- 
tional motion of the body relative to the centre of mass. In this chapter, we restrict our 
attention to the special case of planar rigid body motion. Three-dimensional motion of 
rigid bodies is considered in Chapter 19. 


11.1. THE MOMENT OF A FORCE 


We begin with the definition of the moment of a force about a point, which is a 
vector quantity. The moment of a force about an axis, a scalar quantity, is the component 
along the axis of the corresponding vector moment. 


Definition 11.1 Moment of a force about a point Suppose a force F acts on a parti- 
cle P with position vector r relative to an origin O. Then ko, the moment' of the force 
F about the point O is defined to be 


ko =rxF, (11.1) 


* The other two are the energy and linear momentum principles. 
* Also called torque, especially in the engineering literature. 
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FIGURE 11.1 Left: Geometrical interpretation of the vector moment ko = rx F.. Right: The 
right- and left-handed senses around the ‘axis’ {O, n}. 


a vector quantity. If the system of particles P,, P2,..., Pn, with position vectors rj, ro, 

.., 'y are acted upon by the system of forces F,, F2,..., Fy respectively, then Ko, 
the total moment of the system of forces about O is defined to be the vector sum of the 
moments of the individual forces, that is, 


N 
Kos y rk, (11.2) 

i=l 
Since any fixed point can be taken to be the origin O, there is no loss of generality in the 
above definitions. However, there are occasions on which it is convenient to take moments 
about a general point A whose position vector is a. To find K 4, we simply replace r; in 
the above definitions by the position vector of P; relative to A, namely, r; — a. This gives 


N 


Ka=)_ (ri—4@)xFi. (11.3) 


i=l 


It follows that K 4 and K 9 are simply related by 
K A> K OoO— 4x F, 


where F is the resultant force. Hence, if F is zero, the total moment of the forces { F; } is 
the same about every point. Such a force system is said to be a couple with moment K. 


Geometrical interpretation of vector moment 


The formula (11.1) has a nice geometrical interpretation. Let P be the plane that contains 
the origin and the force F’, as shown in Figure 11.1. Let m be a unit vector normal to P, 
and suppose that F acts in the right-handed (or positive) sense around the ‘axis’ {O, n}. 
(This is the case shown in Figure 11.1.) Then, from the definition (1.4) of the vector 
product, 


Ko =rxF= (Ini |Flsin(gx +a)) n= F(rcosa)n = (F x p)n, 


where F is the magnitude of F and p (= OM) is the perpendicular distance of O from 
the ‘line of action’ of F. Thus, K 9 has magnitude F’ x p and points in the n-direction. If 
F has the left-handed (or negative) sense around {O, n}, then Ko = —(F x p)n. 
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FIGURE 11.2 The moment of the force F 
about the axis {O,n} is p x (F -@). 


Motion in a plane 


Suppose we have a system of particles that lie in a plane, and the forces acting on the 
particles also lie in this plane. Such a system is said to be two-dimensional. Then the 
total moment K ¢ of these forces about a point O of the plane is given by 


N N 
Ko =)  +Fi pin= (>: “Finn 


i=l i=l 


where the plus (or minus) sign is taken when the sense of F; around the axis {O, n} 
is right- (or left-) handed. This formula explains why, in two-dimensional mechanics, 
the moment of a force can be represented by the scalar quantity + F x p. In the two- 
dimensional case, the directions of all the moments are parallel, so that they add like 
scalars. However, in three dimensional mechanics, the moments have general directions 
and must be summed as vectors. 


Moments about an axis 


Definition 11.2 Moment of a force about an axis The component of the moment K 0 
in the direction of a unit vector n is called the moment of F about the axis* {O, n}; it is 
the scalar quantity Ko +n. 


This axial moment can be written (see Figure 11.2) 


Ko -n=(rxF)-n=(nxr)+ F =((rsina) $) + F 
— p (F ° ¢) ‘ 
where p is the distance of P from the axis {O, n} and ¢ is measured around the axis. The 


direction of the unit vector @ is called the azimuthal direction around the axis {O, n}. 
Thus F - ¢ is the azimuthal component of F. 


* This ‘axis’ is merely a directed line in space. It does not neccessarily correspond to the rotation of any 
rigid body. 
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11.2 


Example 11.1 Finding moments (numerical example) 


A force F = 2i — j — 2k acts on a particle located at the point P(0, 3, —1). Find 
the moment of F about the origin O and about the point A(—2, 4, —3). Find also the 
moment of F about the axis through O in the direction of the vector 3i — 4k. 


Solution 
The moment K g is given by 


Ko =rxF=@83j-k)xQi-j—2k =-7i-2j—6k. 
Similarly, 
Ka=(r-—a)xF=(Qi-j+2k)xQi-j—-2k)=4i14+ 87. 


The required axial moment is K g - n, where n is the unit vector in the direction 
of 3i — 4k, namely 
3i—4k 3i—4k 
r= = 
|3i-—4k| 5 


Hence 


3i-4k\ 3 
Koen =i =37 6k -(~ )- 


=—.m 
5 5 
Example 11.2 Total moment of gravity forces 


A system S moves under uniform gravity. Show that the total moment of the gravity 
forces about any point is the same as if all the mass of S were concentrated at its 
centre of mass. 


Solution 
Without losing generality, let the point about which moments are taken be the origin 
O. Under uniform gravity, F; = —mj; gk, where the unit vector k points vertically 


upwards, so that 


N N 
Ko =) \rix(—migh) = (dyn) x (—gk) = (MR) x (—gk) 
i=l i=l 
= Rx(—Mgk), 


where M is the total mass of S and R is the position vector of its centre of mass. This 
is the required result. Note that it is only true for uniform gravity. 


ANGULAR MOMENTUM 


We begin with the definition of the angular momentum of a particle about a fixed 


point. The old name for angular momentum is ‘moment of momentum’ and that is exactly 
what it is - the moment of the linear momentum of the particle about the chosen point. 
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Definition 11.3 Angular momentum about a point Suppose a particle P of mass m 
has position vector r and velocity v. Then Ig, the angular momentum of P about O is 
defined to be 


lo =rx(mv), (11.4) 
a vector quantity. If the system of particles P,, P2,..., Py, with masses m,,m2,...,™MN, 
have position vectors r\, r2,..., rN and velocities v1, v2,..., UN respectively, then Lo, 


the angular momentum of the system about O, is defined to be the vector sum of the 
angular momenta of the individual particles, that is, 


N 


Lo = Yori x mjuj). (11.5) 


i=1 


The corresponding formula for angular momentum about a general point A is therefore 


N 
La= > (rj — a) x (mj;0;), 


i=1 
from which it follows that L 4 and Lo are simply related by 
L A= Lo —axP, 


where P is the total /inear momentum of the system. 

The geometrical interpretation of the angular momentum of a particle is similar to that 
of moment of a force (see Figure 11.1). Let P be the plane that contains O, P and the 
velocity v, and let m be a unit vector normal to P. Then 


Lo =+(mv x p)n, 


where v is the magnitude of v and p is the perpendicular distance of O from the line 
through P parallel to v. The + sign is decided by the sense of v around the axis {O, n}, 
as shown in Figure 11.1. 


Example 11.3 Calculating the angular momentum of a particle 


The position of a particle P of mass m at time t is given by x = a6?, y = 2a0, 
z = 0, where 6 = O(t). Find the angular momentum of P about the point B(a, 0, 0) 
at time f. 


Solution 
The position vector of the particle relative to B at time ¢ is 


r—b = (a67i + 2a0j) ai =a[ 6? - Hi +26/] 
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FIGURE 11.3 The particle P slides on the 
inside surface of the axially symmetric bowl 


z= f(p). 


and the velocity of the particle at time ¢ is 


‘ dr ar : dé 2a(6i + j)6 
=— = — — =2a(0l ; 
dt do dt 2 


The angular momentum of the particle about B at time ¢ is therefore 


Lp = (r — b)x (mv) = 2mab [@? ire 26 j| x [Oi + J] 


~2ma’ (6? + 1)60k. 


Angular momentum about an axis 

Definition 11.4 Angular momentum about an axis The component of the angular 
momentum Lo in the direction of a unit vector n is called the angular momentum of 
P about the axis {O, n}; it is the scalar quantity Lo «Nn. 


By that same argument as was used for moments about an axis, the angular momentum of 
a particle of mass m and velocity v about the axis {O, n} can be written in the form 


Lo-n=mp(v-$), (11.6) 


where p is the perpendicular distance of the particle from the axis and v o is the azimuthal 
component of v around the axis. 


Example 11.4 Particle sliding inside a bowl 


A particle P of mass m slides on the inside surface of an axially symmetric bowl. 
Find its angular momentum about its axis of symmetry in terms of the coordinates p, 
@ shown in Figure 11.3. 


Solution 

In order to express Lg - k in terms of coordinates, we draw a velocity diagram for 
the system as explained in section 10.10. The velocities vp and vg, corresponding to 
the coordinates p and ¢, have the directions shown in Figure 11.3. These two veloci- 
ties are perpendicular, with the vg contribution in the azimuthal direction around the 
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FIGURE 11.4 The rigid body B6 rotates about 
the fixed axis {O, n} with angular velocity w. \ 


vertical axis {O, k}. It follows that v - ¢ = Up = pd. The required axial angular 
momentum is therefore 


Lo +k = mp(v+$) = mp(pd) = mp’. 


Just for the record, the velocity v, is the (vector) sum of 6 radially outwards and 
z vertically upwards. Note that Zz is not an independent quantity. If the equation of 
the bowl is z = f(p), then Zz = f’(p)/. In particular then, the kinetic energy of P is 
given by 


T = ym| 6? + (f')A)’ + (04)']. 


and its potential energy by V = mgf(p). @ 


11.3. ANGULAR MOMENTUM OF A RIGID BODY 


The probem of finding the angular momentum of a moving rigid body in the 
general three-dimensional case is tricky and is deferred until Chapter 19. In the present 
chapter we essentially restrict ourselves to the case of planar rigid body motion, for which 
it is sufficient to find the angular momentum of the body about its axis of rotation. This 
axis may be fixed (as in the armature of a motor) or, more generally, may be the instan- 
taneous rotation axis through the centre of mass of the body (as in the case of a rolling 
penny). In this section we consider only the case of rotation about a fixed axis; the case of 
planar motion is treated in section 11.6. 

Consider a rigid body 6 rotating with angular velocity w about the fixed axis {O, n}, 
as shown in Figure 11.4. Then the angular momentum of the body about this axis is 


N 


N 
Lo-n= (3); n= Su» = > mpi(v; +9). 


i=l i=1 


where p; is the perpendicular distance of m; from the axis, and 0; - @ is the azimuthal 
component of v; around the axis (see formula (11.6)). But, since the body is rigid, the 
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velocity of m; is entirely azimuthal and is equal to wp;. Hence 


N 
Lo-n= (Smt) o=se (11.7) 


i=l 


where J is the moment of inertia of B about the rotation axis {O, n}. We have thus proved 
that: 


Angular momentum of a rigid body about its rotation axis 


If a rigid body is rotating with angular velocity w about the fixed axis {A, n}, then 
the angular momentum of the body about this axis is given by 


ILA ci =I @, (11.8) 


where J is the moment of inertia of the body about the axis {O, n}. 


It should be remembered that, if a rigid body of general shape is rotating about the 
fixed axis {O, n}, then Lo, the angular momentum of the body about O, is not generally 
parallel to the rotation axis. If the rotation axis happens to be an axis of rotational sym- 
metry of the body, then Lo will be parallel to the rotation axis and Lo is simply given 


by 
Lo = Uo)n. (11.9) 
Example 11.5 Axial angular momentum of a hollow sphere 


A hollow sphere of inner radius a and outer radius b is made of material of uniform 
density p. The sphere is spinning with angular velocity Q about a fixed axis through 
its centre. Find the angular momentum of the sphere about its rotation axis. 


Solution 

From equation (SysA:L=Iomega), the angular momentum of the sphere about its rota- 
tion axis is given by L = I w, where J is its moment of inertia and @ is its angular 
velocity about this axis. In the present case, 


i e— 2Mb" _ ma’, 
where M = 4pb3/3 and m = 4pa?/3, giving 
T= ae (0° <a" ) . 
The angular momentum of the sphere about its rotation axis is therefore 


b= (65-05) 0. 
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11.4 THE ANGULAR MOMENTUM PRINCIPLE 


We now derive the fundamental result which relates the angular momentum of 

any system to the external forces that act upon it — the angular momentum principle. 
Consider the general multi-particle system S which consists of particles P;, P2, ..., 
Py, with masses 1, m2, ..., my and velocities vj, v2, ..., vy, as shown in Figure 9.1. 
Suppose that S is acted upon by external forces F;; and internal forces G;;, as shown in 


Figure 9.3. Then the equation of motion for the particle P; is 
mj— =F; + Gi, (11.10) 


where, as in Chapter 9, we take G;; = 0 wheni = j. Then the rate of increase of the 
angular momentum of the system S about the origin O can be written 


N N 
dL d dv; ; 
— = (3 rina) = ) {rex (mi S22) 4 Fx oma} 


i=l i=l 
N 
dv; 
=) x 1; —— fs 
é dt 
i=l 


since rj X (mjv;) = m;v; X vj = 0. On using the equation of motion (11.10), we obtain 


dio | WN N N N oN 
7a Dts Fi+ >) Gij a Exe, + yy ree 
i=l j=l i=1 i=1 j=1 
N fi-1 
=Ko + Do |) (rixGy+rjxGy) |, (11.11) 
i=2 \j=1 


where K ¢ is the total moment about O of the external forces. We have also grouped 
the terms of the double sum in pairs and omitted those terms known to be zero. Now 
the internal forces {G;;} satisfy the Third Law, which means that G;; must be equal and 
opposite to G;;, and that G;; must be parallel to the line P; P;. It follows that 


rpxGjj+rjxGjj=rj)xGij—rjxGij = (ri —rj)xGij =9, 


since G;; is parallel to the vector r; — r;. Thus each pair of terms of the double sum in 
equation (11.11) is zero and we obtain 


ais 
= K 
Ai 0; 


which is the angular momentum principle. Since any fixed point can be taken to be the 
origin, this proves that: 
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Angular momentum principle about fixed points 


dLs (11.12) 


ae 


for any fixed point A. This fundamental principle can be stated as follows: 


Angular momentum principle about a fixed point 


In any motion of a system S, the rate of increase of the angular momentum of S 
about any fixed point is equal to the total moment about that point of the external 
forces acting on S. 


It should be noted that only the external forces appear in the angular momentum prin- 
ciple so that the internal forces need not be known. It is this fact which gives the princi- 
ple its power. 


Question Overusing the angular momentum principle 


The angular momentum principle can be applied about any point. Are all the resulting 
equations independent of each other? 


Answer 


The short answer is obviously no. The long answer is as follows: From the definitions 
of K and L, we have already shown that 


K,a= Ko -—axF, 
and 
La=Lo -—axP, 


where F is the total force acting on the system S, and P is its linear momentum. It 
follows that, for any fixed point A, 


Ka La=(Ko io) ax(F-P). 


Hence, if the linear momentum principle P = F andthe angular momentum principle 
Lo = Kg have already been used, then nothing new is obtained by applying the 
angular momentum principle about another point A. 


Angular momentum principle about the centre of mass 


The angular momentum principle in the form (11.12) does not generally apply if A is 
a moving point. However, the standard form does apply when moments and angular 
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momenta are taken about the centre of mass G, even though G may be accelerating. This 
follows from the theorem below. The corresponding result for kinetic energy appeared in 
Chapter 9. 


Theorem 11.1 Suppose a general system of particles S has total mass M and that its 
centre of mass G has position vector R and velocity V. Then the angular momentum of S 
about O can be written in the form 


Lo =Rx(MV)+Le, (11.13) 


where Lg is the angular momentum of S about G in its motion relative to G. 
Proof. By definition, 


N 


Lo= Yomi (r; — R)x(v; — V) 
i=l 


N N N 
= So mari xv; — (dy) xV— Rx (dymm) + ( 
i=1 i=l i=l 


= Lo —(MR)xV — Rx(MV)+ M(RxV) 
= Lo — Rx(MV), 


N 
m RxV 
=f 


l 


as required. i 


The two terms on the right of equation (11.13) have a nice physical interpretation. 
The term R x (MV) is the translational contribution to Lg while the term Lg is the 
contribution from the motion of S relative to G. If S is a rigid body, then the motion of 
S relative to G is an angular velocity about some axis through G, and the term Lg then 
represents the rotational contribution to Lo. 

The angular momentum principle for S about O can therefore be written 


hoe “Giese 
= — x —_. 
0 dt dt 
~ aL 
=MRxV+—£. 
dt 
Furthermore, since K9 = Kg + Rx F, it follows that 
dLg 
Kg =" +Rx(MVv -F) 
_ dLg 
dt’ 


on using the /inear momentum principle. We therefore obtain: 


Angular momentum principle about G 
die (11.14) 


dt 
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Thus the standard form of the angular momentum principle applies to the motion of S 
relative to the centre of mass G. 


The rigid body equations 


The linear and angular momentum principles provide sufficient equations to determine 
the motion of a single rigid body moving under known forces. The standard form of the 
rigid body equations is 


Rigid body equations 


—__ — fF Sie Ke 
dt dt 


in which we have taken both the linear and angular momentum principles in their centre of 
mass form. The linear momentum principle thus determines the translational motion of 
G (as if it were a particle), and the angular momentum principle determines the rotational 
motion of the body about G. 

We will use a subset of these equations later in this chapter to solve problems of 
planar rigid body motion. The delights of general three-dimensional rigid body motion* 
are revealed in Chapter 19. 


Example 11.6 Rigid body moving under uniform gravity 


A rigid body is moving in any manner under uniform gravity. Show that its motion 
relative to its centre of mass is the same as if gravity were absent. 


Solution 
Under uniform gravity, the total moment of the gravity forces about any point is the 
same as if they all acted at G, the centre of mass of the body (see Example 11.2). It 
follows that Kg = 0. 

The rigid body equations (11.15) therefore take the form 


dV dLg 

M— =—Megk, — =0 

dt Bi dt 

Hence, when a rigid body moves under uniform gravity, G undergoes projectile 
motion (which we already knew), and the equation for the motion of the body rel- 
ative to G is the same as if the body were moving in free space. 


* The difficulty in the three-dimensional case is the calulation of L. 
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11.5 CONSERVATION OF ANGULAR MOMENTUM 


Isolated systems 


Suppose that S is an isolated system, and let A be any fixed point. Then Ly, the 
total moment about A of the external forces acting on S, is obviously zero. The angu- 
lar momentum principle (11.12) then implies that dL 4/dt = 0, which implies that L 4 
remains constant. The same argument holds for Lg. This simple but important result can 
be stated as follows: 


Conservation of angular momentum about a point 


In any motion of an isolated system, the angular momentum of the system about any 
fixed point is conserved. The angular momentum of the system about its centre of 
mass is also conserved. 


For example, the angular momentum of the solar system about any fixed point (or 
about its centre of mass) is conserved. The same is true for an astronaut floating freely 
in space (irrespective of how he moves his body). The angular momentum of a system 
about its centre of mass may still be conserved even when external forces are present. For 
any system moving under uniform gravity (a falling cat trying to land on its feet, say) 
Kg = 0 which implies that Lg is conserved. 


Angular momentum in central field orbits 


In the case of a particle P moving in a central field with centre O, 
Ko =rxF=0, 


since r and F are parallel. This implies that Lo is conserved. (Angular momentum about 
other points is not conserved.) By symmetry, each possible motion of P must take place 
ina plane through O and we may take polar coordinates 7, 6 (centred on O) to specify the 
position of P in the plane of motion. In terms of these coordinates, 


Lo =rx(mv) =m(rr)x (FF + (r6) 6) = mr76n, 
where the constant unit vector n (= 7 x@) is perpendicular to the plane of motion. Hence, 
in this case, conservation of Lg is equivalent to conservation of Lo - n, the angular 
momentum of P about the axis {O, n}. The conclusion is then that the quantity 


Lo -n=mr6 =; 


where L is a constant. This important result was obtained in Chapter 7 by integrating 
the azimuthal equation of motion for P. We now see that it is a consequence of angular 
momentum conservation and that the constant L is the angular momentum* of P about 
the axis {O, n}. 


* The constant L used in Chapter 7 was actually the angular momentum per unit mass. 
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FIGURE 11.5 The particle slides on the table while the string is pulled 
down through the hole. 


Conservation of angular momentum about an axis 

Even when K 4 ¥ 0 it is still possible for angular momentum to be conserved about a 
particular axis through A. Let n be a fixed unit vector and A a fixed point so that {A, n} 
is a fixed axis through the point A. Then 


d dL d dL 
Oasis nes Sa 
dt dt 


Hence, if K 4 - n = Oat all times, it follows that L, - n is conserved. This result can be 
stated as follows: 


Conservation of angular momentum about an axis 


If the external forces acting on a system have no total moment about a fixed axis, 
then the angular momentum of the system about that axis is conserved. The same 
applies for a moving axis which passes through G and maintains a constant direc- 
tion. 


In our first example, angular momentum conservation is sufficient to determine the 
entire motion. 


Example 11.7 Pulling a particle through a hole 


A particle P of mass m can slide on a smooth horizontal table. P is connected to a 
light inextensible string which passes through a small smooth hole O in the table, so 
that the lower end of the string hangs vertically below the table while P moves on top 
with the string taut (see figure 11.5). Initially the lower end of the string is held fixed 
with P moving with speed wu on a circle of radius a. The string is now pulled down 
from below in such a way that the string above the table has the length r(¢) at time f. 
Find the velocity of P and the tension in the string at time ¢. 


Solution 


We must first establish that some component of angular momentum is conserved in 
this motion. The forces acting on P are gravity, the normal reaction of the smooth 
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table, and the tension in the string. Since the first two are equal and opposite and the 
tension force points towards O, it follows that KgQ = 0. Thus, however the string is 
pulled, Lo is conserved in the motion of P. 

Now we must calculate Lg. As in the case of central field orbits, Lg is perpen- 
dicular to the plane of motion and conservation of Lg is equivalent to conservation 
of the axial angular momentum Lg - k. Hence, as in orbital motion, 


Lo -k=mr?6=L, 


where the constant L is given by the initial conditions to be L = mau. Hence, in the 
motion of P, the conservation equation 


mr26 = mau 


is satisfied. Since r(¢) is given, this equation is sufficient to determine the motion of 
P. In particular, the velocity of P at time ¢ is given by 


a~ 


v=rFrt+(rO)O=7F+ (=) 0, 


from which we see that the transverse velocity of P tends to infinity as r tends to zero. 
The string tension T can be found from the radial equation of motion for P, 
namely, 


which gives 


= Ae — 42) — anu 28s 
T=m({rée r)=m 5 r). 
7 


For example, in order to pull the string down with constant speed, the applied tension 
must be 


This tends rapidly to infinity as r tends to zero, making it impossible to pull the 
particle through the hole! 


Our second example belongs to a class of problems that could be called ‘before and 


after problems’. We have encountered the same notion before. In elastic collision prob- 


lems, the linear momentum and energy of the system are conserved and these conservation 


laws are used to relate the initial state of the system (before) to the final state (after). This 


provides information about the final state that is independent of the nature of the parti- 


cle interaction. Conservation of angular momentum can be exploited in the same way. 


In the following example, angular momentum conservation is sufficient to determine the 


final state uniquely. 
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I 
Final state 


Initial state 


FIGURE 11.6 The beetle and the ball: the ball is smoothly pivoted about a vertical diameter and the 
beetle crawls on the surface of the ball. 


Example 11.8 The beetle and the ball 


A uniform ball of mass M and radius a is pivoted so that it can turn freely about one 
of its diameters which is fixed in a vertical position. A beetle of mass m can crawl 
on the surface of the ball. Initially the ball is rotating with angular speed Q with 
the beetle at the ‘North pole’ (see Figure 11.6 (left)). The beetle then walks (in any 
manner) to the ‘equator’ of the ball and sits down. What is the angular speed of the 
ball now? 


Solution 

We must first establish that some component of angular momentum is conserved. 
The external forces acting on the system of ‘beetle and ball’ are shown in Figure 11.6 
(centre). The forces X and Y are the constraint forces exerted by the pivots. The total 
moment of the external forces about O is therefore 


Ko = 0x (—Mgk) + rx (—mgk) + (ak) x X + (—ak)xY. 
It follows that 
Ko -k=0, 


since all the resulting triple scalar products contain two k’s. Hence Lo - k, the angu- 
lar mometum of the system about the rotation axis, is conserved, irrespective of the 
wanderings of the beetle. It follows that this axial angular momentum is the same 
after as it was before. 

In the initial state, the angular momentum of the ball about its rotation axis is 
given by J Q, where J = 2Ma?/5. Initially the beetle has zero velocity so its angular 
momentum is zero. Hence the initial value of the axial angular momentum is 


Lo- k= (3Ma’) Q. 


In the final state the ball has an unknown angular velocity Q’ and axial angular 
momentum 2Ma?Q'/5. The velocity of the beetle is entirely azimuthal and is equal 
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to Q’a. Hence, on using the formula (11.6), the axial angular momentum of the beetle 
is given by mp(v + @) = ma(Qa). The final value of the axial angular momentum is 
therefore 


Lo-k= (3Ma’) Q! + ma(Qla) = 12M + 5m)a?a". 
Since Lo - k is known to be conserved it follows that 
(2M + 5m)a°Q! = 2Ma’Q, 


and hence the final angular velocity of the ball is 


Y= ala Q 
~ \2M +5m 


Question Change in kinetic energy 
Find the change in kinetic energy of the system caused by the beetle’s journey. 


Answer 


The initial and final kinetic energies of the system are 
5 (3Ma") Q? and 4 (3Ma’) a? 4 tm(aQ’)? 


respectively. On using the value of 9’ found above and simplifying, the kinetic 
energy of the system is found to decrease by 


mMa2Q2 . 
2M +5m- 


Question Red hot beetle 


Does this loss of energy mean that the beetle arrives in a red-hot condition? 


Answer 


Your mechanics lecturer will be pleased to answer this question. 


Our last example, the spherical pendulum, is a system with two degrees of freedom. 
By using both angular momentum and energy conservation, a complete solution can be 
found. 


Example 11.9 The spherical pendulum: an integrable system 


A particle P of mass m is suspended from a fixed point O by a light inextensi- 
ble string of length a and moves with the string taut in three-dimensional space (the 
spherical pendulum). Show that angular momentum about the vertical axis through 
O is conserved and express this conservation law in terms of the generalised coor- 
dinates 6, ¢, as shown in Figure 11.7. Obtain also the corresponding equation for 
conservation of energy. 
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External forces Velocity diagram 


FIGURE 11.7 The spherical pendulum with generalised coordinates 6 and @. Left: the external 
forces. Right: the velocity diagram. 


Initially the string makes an acute angle a with the downward vertical and the 
particle is projected with speed u in a horizontal direction at right angles to the string. 
Determine the constants of the motion, and deduce an equation satisfied by @(t) in 
the subsequent motion. 


Solution 


The external forces on the particle are gravity and the tension in the string (see Figure 
11.7 Cleft)). Hence, 


Ko =rx(—mgk)+rxT = —merxk, 
the second term being zero since r and T are parallel. It follows that 
Ko-k=-meg(rxk)-k =0, 


since the triple scalar product has two k’s. Hence Lo - k is conserved. 

In order to express this conservation law in terms of coordinates, we draw a veloc- 
ity diagram for the system as explained in section 10.10. The velocities corresponding 
to the coordinates 6 and @ are aé and pd (= (asin 6)p) respectively in the direc- 
tions shown in Figure 11.7 (right). These two velocities are perpendicular, with the 
(a sin 0)p contribution in the azimuthal direction around the vertical axis {O, k}. It 
follows that v - ¢ = (asin0)@. The required axial angular momentum is therefore 


Lo-k=mp(v- ¢) = m(asin6@)(a sind ¢) = ma? sin” Od 
and conservation of Lo - k is expressed by 
ma’ sin’ 6 db = bh; 


where the axial angular momentum L is a constant of the motion. 
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The diagram also shows that the kinetic energy of P is given by 
sm ((ad)? + (asin@g 6) 


and the potential energy by V = —mg(acos@). Conservation of energy therefore 
requires that 


1m ((a6)? + (asin@ 6”) — mg(acos6) = E, 


where the total energy EF is a constant of the motion. 
From the prescribed initial conditions, 


L=m(asina)u, £= smu — mgacosa, 


so that the subsequent motion of P satisfies the conservation equations 


ma’ sin? 6 @ = ma sina u, (11.16) 


sm (06? +a’ sin? 6 é) —mgacosé = smu” —mgacosa. (11.17) 


Since the spherical pendulum has two degrees of freedom, these two conservation 
equations are sufficient to determine the motion. Moreover, the system is inte- 
grable (see section (10.10) so that it must be possible to reduce the solution of the 
problem to integrations. 

The equations (11.16), (11.17) are a pair of coupled first order ODEs for the 
unknown functions 6(t), @(t). However, because the coordinate ¢@ only appears as d 
in both equations, @ can be eliminated (6 can not!). From equation (11.16) we have 


= usina (11.18) 
~ asin? 6 : 


and this can now be substituted into equation (11.17) to obtain an equation for @(ft) 
alone. After some algebra we find that 


uz (oe “t). (11.19) 


42 
6° = — (cosa — cos) 
a? sin? 0 ur 


which is the required equation satisfied by 6(t). On taking square roots, this equa- 
tion becomes a first order separable ODE whose solution can be written as an inte- 
gral. Now that @(¢) is ‘known’, #(t) can be found (as another integral) from equation 
(11.18). Thus, as predicted, the solution has thus been reduced to integrations. lH 


Question Form of the motion 
This is all very well, but what does the motion actually look like? 
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FIGURE 11.8 The calculated path of the spherical pendulum for the case « = 2/6 and u*/ag = 1.9. 
Left: After four oscillations of 6. Right: After ten oscillations of 6. The surrounding boxes show the 
perspective. 


Answer 


Despite the problem being called integrable, the integrals arising from the separation 
procedure cannot be evaluated and no explicit solution is possible. However, equation 
(11.19) has the form of an energy equation for a system with one degree of freedom. 
We have met this situation before with the radial motion equation in orbit theory and 
the deductions we can make are the same. Because the left side of (11.19) is positive, 
it follows that the motion is restricted to those values of @ that make the function 


6 2 
F = (cosa — cos@) (a 2) 


sin? @ To 
positive. Moreover, maximium and minimum values of @ can only occur when 
F(@) =0. 

Since F(a) = 0, 6 = @ is one extremum” and any other extremum must be a 
root of the equation G(@) = 0, where 


cosa+cos@ 2ag 
< 


sin? 0 u 


Whether @ is a maximum or minimum point of 6 depends on the value of the initial 
projection speed uw. On differentiating equation (11.19) with respect to ft, we find that 
the initial value of 0 is given by 


ij u2 (cosa ag 
lone — a2 2 ee 


sin* a Uu 


* This is because of the form of the initial conditions. 
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so that @ initially increases if u2/ag > sin’ a/ cosa, and @ initially decreases if 
u-/ag < sin? w / cosa. (The critical case corresponds to the special case of conical 
motion.) Suppose that the first condition holds. Then 6min = a@ and Omax must be a 
root of the equation G(@) = 0. Since G(@) > 0 and G(z — a) < 0, such a root does 
exist and is less than zm — a. 

For example, consider the particular case in which a = 2/3 and u? = 4ag. Then 
the equation G(@) = 0 simplifies to give 


cos (cos @ + 2) = 0, 


from which it follows that @max = 2/2. Hence, in this case, @ oscillates periodically 
in the range 7/3 < 0 < m/2. 

At the same time as the coordinate @ oscillates, the coordinate @ increases in 
accordance with equation (11.18). Hence, during each oscillation period t of the 
coordinate 0, @ increases by 


usina [* dt 
®= a5 
a 0 sin* 0 


This pattern of motion repeats itself with period t, but the motion is only truely peri- 
odic if it eventually links up with itself; this occurs only when the initial conditions 
are such that ®/z is a rational number. 

Figure 11.8 shows an actual path of the spherical pendulum, corresponding to the 
initial conditions a = 2/6 and u?/g = 1.9. The results are entirely consistent with 
the theory above. Hl 


11.6 PLANAR RIGID BODY MOTION 


What is planar motion? 


Planar motion is a generalisation of two-dimensional motion in which two-dimensional 
methods are still valid. The complications of the full three-dimensional theory (repre- 
sented by the two vector equations (11.15)) melt away to leave three scalar equations, 
which often have a very simple form. This enables a variety of fascinating problems to be 
solved in simple closed form. Planar rigid body motion is good value for money! 

A system is said to be in planar motion if each of its particles moves in a plane 
and all of these planes are parallel to a fixed plane P called the plane of motion. For 
example, any purely translational motion of a rigid body in which G moves in the plane 
P is a planar motion, as is any purely rotational motion about an axis through G that is 
perpendicular to P. The same is true when both of these motions are present together. For 
example, a cylinder (of any cross-section) rolling down a rough inclined plane is in planar 
motion. It is not necessary for the bodies that make up our system to be cylinders, nor 
even to be bodies of revolution. We merely suppose that each constituent of the system 
should have reflective symmetry in the plane of motion,* as shown in Figure 11.9. 


* The reason for this symmetry restriction is that, if a body of completely general shape (a potato, say) were 
started in planar motion and moved under realistic forces (uniform gravity, say), then the motion would 
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FIGURE 11.9 Three typical elements of a system in planar motion. 
The particle P moves in the plane of motion y = 0; the elliptical 
crank rotates about the fixed axis {O, j}; and the circular pulley is in 
general planar motion. In the last case, G moves in the plane of 
motion, and the pulley also rotates about the axis {G, j}. 


Question Bodies in planar motion 
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Decide whether the following rigid bodies are in planar motion: (i) a cotton reel 
rolling on a table, (11) the Earth, and (iii) a snooker ball rolling after being struck with 


‘side’. [If you don’t know what this means, get a player to show you.] 


Answer 


(i) Yes. (i1) No, because the Earth’s rotation axis is not perpendicular to the plane of 
its orbit. (iii) No, because the rotation axis of the ball is not horizontal when the ball 


is struck with ‘side’. 


Angular momentum in planar motion 


The fact that makes planar motion so special is that the total angular momentum of each 


rigid body in the system has a constant direction normal to the plane of motion. This 


follows from the reflective symmetry that each body has in the plane of motion. In other 


words, if A is any point lying in the plane of motion, then the angular momentum of each 


rigid body about A has the simple form 


La=Laj, 


where, as shown in Figure 11.9, the plane of motion has been taken to be y = 0, and L4 


is a short form for the axial angular momentum L , - j. Similar remarks apply to the total 


moment about A of the external forces acting on each rigid body. This follows from the 


not remain planar. However, if the system and the external forces have reflective symmetry in the plane 


of motion, then a motion that is initially planar will remain planar. 
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supposed symmetry of these forces about the plane of motion. Hence the total moment 
about A of the external forces acting on each rigid body has the form 


K,z=Kaj, 


where K y is a short form for the axial moment K , - j. If A is a fixed point, or the centre 
of mass of the body, the angular momentum principle for each rigid body then takes the 
form 


d 
—(Laj)=KaJ 
Fra AJ) AJ 


which, since j is a constant vector, reduces to the scalar equation 


dLa 
— = Kz,. 
dt 7 
Since the axis of rotation of the body in its motion relative to A is also normal to the 
plane of motion, it follows from equation (11.8) that 


La=La-j=Iy,a, 


where w is the angular velocity of the body, and /,4 its moment of inertia, about the axis 
{A, j}. We therefore obtain the planar angular momentum principle in the form 


d 
—(U = Ka, 11.20 
Au AW) A ( ) 


where A is either some fixed point in the plane of motion, or the centre of mass of the 
body. In applications, the moment of inertia /4 is usually constant. 


Planar rigid body equations 


We are now in a position to reduce the full rigid body equations (11.15) to planar form. 
Since there is no motion in the j-direction, only the i- and k-components of the linear 
momentum principle survive, and, as we showed in the last section, only the j-component 
of the angular momentum principle survives. Thus, each rigid body in planar motion 
satisfies the three scalar equations of motion: 


Planar rigid body equations 


ay i r (11.21) 
Soe Mo2Se Te are 


dt “ato dt 


where we have taken the angular momentum principle about the centre of mass G; /g is 
then constant. These are the planar rigid body equations. 
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FIGURE 11.10 The pulley rotates about the Vv 
fixed horizontal axis {O, j} and the { 
suspended particle moves vertically. mg 


A special case arises when the body is rotating about a fixed axis like the elliptical 
crank shown in Figure 11.9. Although the equations (11.21) could still be used, this is not 
the quickest way. Let the fixed axis be {O, j}, where O lies in the plane of motion (see 
Figure 11.9). If the angular momentum principle is now applied about O (instead of G), 
then the unknown reactions exerted by the pivots make no contributuion to Kg and do not 
appear in the third equation of (11.21). The first two equations in (11.21) serve only to 
determine these reactions, once the motion has been calculated. Hence, unless the pivot 
reactions are actually required, it is sufficient to use the single equation: 


Rigid body equation — fixed axis 
ae (11.22) 


In this case also, the moment of inertia /¢ is constant. 
Example 11.10 Planar motion: mass hanging from a pulley 


A circular pulley of mass M and radius a is smoothly pivoted about the axis {O, j}, 
as shown in Figure 11.10. A light inextensible string is wrapped round the pulley 
so that it does not slip, and a particle of mass m is suspended from the free end. 
The system undergoes planar motion with the particle moving vertically. Find the 
downward acceleration of the particle. 


Solution 

This problem is most easily solved by using energy conservation, but it is instruc- 
tive to solve it as a planar motion problem to illustrate the difference between the 
two approaches. In the energy conservation approach, the whole system is considered 
to be a single entity, and in this case, the string tensions do no fotal work and need 
not be considered. In the setting of planar motion however, the system consists of 
two elements, (i) a particle moving in a vertical straight line, and (ii) a rigid pulley 
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rotating about a fixed horizontal axis. For each constituent, the tension force exerted 
by the string is external. The string tensions (which are equal in this problem) there- 
fore appear in the equations of motion. For this reason, the conservation method is 
simpler, but there are many problems where there is no useful conservation principle 
and the planar motion approach is essential. 

Let the particle have downward vertical velocity v, and the pulley have angular 
velocity w (in the sense shown) at time f. Since the vector j points into the page, 
this is the positive sense around the axis {O, j}. Thus, in the notation used here, the 
positive sense for angular velocity is clockwise. The same applies to moments and 
angular momenta. 

First consider the motion of the particle. Since the motion is in a vertical straight 
line, the only surviving euation is 


dv r 
m— =meg-T, 

dt = 
where T is the string tension at time f. 


Now consider the motion of the pulley. Since this is rotating about a fixed axis, 
the equation of motion is [gdw/dt = Kg, that is, 


since the weight force M g and the pivot reaction Z have zero moment about O. 
Hence, the unknown tension T can be eliminated to give 


This equation applies whether or not the string slips on the pulley. However, since 
we are given that the string does not slip, v and w must be related by the no-slip 
condition v = wa. On using this condition in the last equation, we obtain 


dv _ ma? 

dt \ma2+Io 2 
This is the downward acceleration of the particle. If the moment of inertia of the 
pulley is 5M a’, then the value of this acceleration is [2m/(2m + M)]g. 


Our next example, the cotton reel problem, is a famous problem in planar mechanics. 


The mathematics is elementary, but the solution needs to be interpreted carefully. 


Example 11.11 The cotton reel problem 


A cotton reel is at rest on a rough horizontal table when the free end of the thread is 
pulled horizontally with a constant force T, as shown in Figure 11.11. Given that the 
reel undergoes planar motion,* how does it move? 


* Tn practice, it is impossible to maintain planar motion in the problem as described (try it). However, the 
problem is the same if the thread is replaced by a broad flat tape for which planar motion is easier to 
achieve. 
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FIGURE 11.11 The reel is initially at rest on a rough horizontal table, 
when the free end of the thread is pulled with a constant force. 


Solution 
Suppose the ends of the reel have radius a, the axle wound with thread has radius 
b (with b < a), and the whole reel together with its thread has mass M. (We will 
neglect the mass of any thread pulled from the reel.) Let the reel have horizontal 
velocity V and angular velocity w in the directions shown at time ¢. Note that we are 
not presuming the variables V, w (or F) must take positive values. The signs of these 
variables will be deduced in the course of solving the problem. 

The external forces on the reel are the string tension T and the fricton force F at 
the table. (The weight force and the normal reaction at the table cancel.) Hence, the 
equations of motion for the reel are 


dV 
—=T -F, (11.23) 
d 

Me = aF —DbT, (11.24) 


where M k? is the moment of inertia of the reel about {G, j}. We are not making any 
prior assumption about whether the reel slides or rolls. We will simply presume that 
the friction force F' is bounded in magnitude by some maximum F'™*, that is, 


—F™* < F< FM, (11.25) 


and that F = +F™* (or —F'™*) when the reel is sliding forwards (or backwards). 

Whether the reel slides or rolls depends on v©, the velocity of the contact particle 
C. Since v° = V — aa, it follows by manipulating the equations (11.23), (11.24) that 
v© satisfies the equation 


Mi’ \ 4 _p_yF (11.26) 
2 nah eet a 


where the constant y is given by 


(11.27) 
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Different cases arise depending on how hard one pulls on the thread. 


Strong pull T > yF™* 


In this case, the right side of equation (11.26) is certain to be positive so that 
dv© /dt > 0 for all t. Since the system starts from rest, v- = 0 initially and so 
v© > 0 for all t > 0. In other words, the reel slides forwards. This in turn implies 
that F = F™* so that the equations of motion (11.23), (11.24) become 


dV T-—F™* 


7] 


dt M 
daw = aP™x _ 5T 
di Mk2 * 


These equations imply that the reel slides forwards with constant acceleration and 
constant angular acceleration. Note that w is positive for yF™™* <T < (a/b)F™ 
and negative for T > (a/b)F™*., 


Gentle pull T < yF™* 
In this case, the reel must roll. The proof of this is by contradiction, as follows. 


Suppose that the reel were to slide forwards at any time in the subsequent motion. Then 


there must be a time Tr, at which v© and dv© /dt are both positive. The condition vf 


> 0 implies 
that F = F™% when t = t, and the condition dv© /dt > 0 then implies that T > y F™* when 
t = t. This is contrary to asumption and so forward sliding can never take place. A similar 


argument excludes backward sliding and so the only possibility is that the reel must roll. 


The reel must therefore satisfy the rolling condition V = wa and this, 
together with the equations of motion (11.23), (11.24), implies that the reel must 
roll forwards with constant acceleration 


dV a(a — b)T 


= .i 
dt M(k2+a2) 


Example 11.12 A circus trick 


In a circus trick, a performer of mass m causes a large ball of mass M and radius a to 
accelerate to the right (see Figure 11.12) by running to the left on the upper surface of 
the ball. The man does not fall off the ball because he maintains this motion in such 
a way that the angle aw shown remains constant. Find the conditions neccessary for 
such a motion to take place. 


Solution 

Suppose the motion is planar and that, at time ¢, the ball has velocity V in the i- 
direction and angular velocity w (= V/a) around the axis {O, j}. If the man is to 
maintain his position on the ball, then he must run up the surface of the ball (towards 
the highest point) with velocity V. This maintains his vertical height and his acceler- 
aton is then the same as that of the ball, namely (dV /dt)i. 
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FIGURE 11.12 The circus trick: forces on the ball and the performer. 


The equations of motion for the man are therefore 


dV . 
sora = Nosina — Fo cosa 


0 = No cosa + Fz sina — mg 


and the equations of motion for the ball are 


dV . 
ars = F; — Nosina+ Fo cosa 


0 = N; — Nocosa — Fzsina — Mg 
Ce ee 
—=aF)-a 
on 2 1 
where Ig is the moment of inertia of the ball about {O, 7}. 
These five equations, together with the rolling condition V = wa, are sufficient to 


determine the six unknowns dV /dt, dw/dt, F,, Ni, Fz and N2. After some algebra, 
the solution for the forward acceleration of the ball turns out to be 


dV _ mg sina 
dt M-+(Io/a2)+m(1+cosa) 


Hence the motion is possible for any acute angle a provided that the performer can 
accelerate relative to the ball with this acceleration. For the case in which the ball is 
hollow, the masses of the man and the ball are equal, and wa = 45°, the acceleration 
required is approximately 0.21 g. M 


11.7 RIGID BODY STATICS IN THREE DIMENSIONS 


Although we are not yet able to attempt problems in which a rigid body under- 
goes three-dimensional motion, we are able to solve problems in which a rigid body is 
in equilibrium under a three-dimensional system of forces. In equilibrium, the linear and 
angular momentum of the body are known to be zero, so that the rigid body equations 
become 
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FIGURE 11.13 The rectangular panel ABC D = 
rests on the rough floor z = 0 and leans > x 
against the smooth wall x = 0. A 
Equations of rigid body statics 
(11.28) 


F=0 K,=0 


where A is any fixed point of space.* In other words, when a system is in equlibrium, the 
resultant force and the resultant moment of the external forces must be zero. 

Since there is no motion, one may wonder what there is left to calculate in statical 
problems. However, the body is usually supported or restrained in some prescribed way, 
and it is the unknown constraint forces that are to be determined. If these constraint forces 
can be determined solely from the equilibrium equations (11.28), then the problem is said 
to be statically determinate.‘ 


Example 11.13 Leaning panel 


A rectangular panel ABC D of mass M is (rather carelessly) placed with its edge AB 
on the rough horizontal floor z = 0 and with the vertex D resting against the smooth 
wall x = 0, as shown in Figure 11.13. The four vertices of the panel are at the points 
A(2, 0,0), B(6, 4,0), C(4, 6, 6) and D(0, 2, 6) respectively. Given that the panel 
does not slip on the floor, find the reaction force exerted by the wall. 


Solution 

The external forces acting on the panel are the normal reaction of the wall Pi, the 
weight force —M gk, and the reaction of the floor on the edge AB. Now the reaction 
of the floor is distributed along the edge AB and, although we could treat it as such, 


* Tt is unneccessary and inconvenient to restrict moments to be taken about G. 

* Not all problems are statically determinate by any means. In the two-dimensional theory, if a heavy 
rigid plank is resting on three or more supports, then the individual reactions at the supports cannot be 
found from the equlibrium equations. What this means is that modelling the plank as a rigid body is 
not appropriate in such a problem. One should instead model the plank as a deformable body, solve the 
problem using the theory of elasticity, and then pass to the rigid limit. 
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FIGURE 11.14 The rod AB is in equilibrium k 
with A on a rough floor and B resting against —Mg 
a smooth wall. The rod is prevented from wa 
falling by the string BC. A 


this is an irrelevant complication. We will therefore suppose that (in order to avoid 
damage to the floor) the panel has been supported on two small pads beneath the 
comers A and B, in which case the reaction of the floor consists of the forces X and 
Y as shown. 

We now apply the equilibrium conditions. The condition F = 0 yields 


Ni+X+Y—Mgk=0. (11.29) 


If we take moments about the corner A, the reaction X makes no contribution and the 
condition K 4 = 0 becomes 


AD x(Ni)+ AB xY+ AG x(—Mgk) = 0. 
On inserting the given numbers, this equation becomes 
N(O6j—-—2k)+ 4i+4j7)xY —Mg GBi- j)=9. (11.30) 


The six scalar equations in (11.29), (11.30) contain the seven scalar unknowns X, 
Y,N which means that the problem is not statically determinate. However, this does 
not stop us from finding N, since Y can be eliminated from equation (11.30) by taking 
the scalar product with the vector 41+4 7. (This is equivalent to taking moments about 
the axis AB so that both X and Y disappear to leave N as the only unknown.) This 
gives the reaction exerted by the wall to be N = Mg/3. @ 


Example 11.14 Leaning rod 


A rough floor lies in the horizontal plane z = 0 and the plane x = 0 is occupied by a 
smooth vertical wall. A uniform rod of mass M has its lower end on the floor at the 
point (a, 0, 0) and its upper end rests in contact with the wall at the point (0, b, c). The 
rod is prevented from falling by having its upper end connected to the point (0, 0, c) 
by a light inextensible string. Given that the rod does not slip, find the tension in the 
string and the reaction exerted by the wall. 
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Solution 


The external forces acting on the rod are the normal reaction Ni of the wall, the 
tension force —Q j in the string, the weight force —M gk, and reaction X of the floor. 
The equilibrium equations are therefore 


Pi—-Oj—Mgk+X=0, (11.31) 
(2Ln) x (Pi — Qj) + (Ln)x(—Mgk) = 0, (11.32) 


where we have taken moments about A to eliminate the reaction X. Here, 2L is the 


length of the rod, and n is the unit vector in the direction AB : 

Equation (11.31) serves only to determine the reaction X once P and Q are 
known. To extract P and Q from the vector equation (11.32), we take components 
in any two directions other than the n-direction; the easiest choices are the i- and 
j-directions. On taking the scalar product of equation (11.32) with i, we obtain 


0=2[n, Pi-— Qj,i]+ [n, —Mgk, i] 
= 2P [n,i,i] —2Q[n, j,i] — Mg [n, k, i] 
=0-2Qn-(jxi) — Mgn- (kxi) 
= 20 (n-k)—Mg(n- j), 


where we have used the notation [u, v, w] to mean the triple scalar product of the 
vectors u, v and w. Hence 


_ Mg(n- j) 
o= 2(n-k) ’ 


and, by taking the scalar product of equation (11.32) with j and proceed in the same 
way, we obtain 


_ Mg (n-i) 
2(n-k) - 


Finally, we need to express these answers in terms of the data given in the ques- 
tion. Since the unit vector n is given by 


_ ~ai+bj+ck 
a aL ; 


it follows that the reaction exerted by the wall, and the tension in the string, are 


M Mgb 
P= = O= a | 
2c 2c 
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Problems on Chapter 11 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


11.1 Non-standard angular momentum principle If A is a generally moving point of space 
and L 4 is the angular momentum of a system S about A in its motion relative to A, show that 
the angular momentum principle for S about A takes the non-standard form 


dLa da 
=K M(R : 
dt . ( aM dt? 


[Begin by expanding the expression for L 4.] 
When does this formula reduce to the standard form? [This non-standard version of the 
angular momentum principle is rarely needed. However, see Problem 11.9.] 


Problems soluble by conservation principles 


11.2 A fairground target consists of a uniform circular disk of mass M and radius a that can 
turn freely about a diameter which is fixed in a vertical position. Initially the target is at rest. 
A bullet of mass m is moving with speed u along a horizontal straight line at right angles to 
the target. The bullet embeds itself in the target at a point distance b from the rotation axis. 
Find the final angular speed of the target. [The moment of inertia of the disk about its rotation 
axis is Ma”/4.] 

Show also that the energy lost in the impact is 


me Ma? 
mu - 
2 Ma? + 4mb? 


11.3 A uniform circular cylinder of mass M and radius a can rotate freely about its axis of 
symmetry which is fixed in a vertical position. A light string is wound around the cylinder so 
that it does not slip and a particle of mass m is attached to the free end. Initially the system 
is at rest with the free string taut, horizontal and of length b. The particle is then projected 
horizontally with speed uw at right angles to the string. The string winds itself around the 
cylinder and eventually the particle strikes the cylinder and sticks to it. Find the final angular 
speed of the cylinder. 


11.4 Rotating gas cloud A cloud of interstellar gas of total mass M can move freely in space. 
Initially the cloud has the form of a uniform sphere of radius a rotating with angular speed Q 
about an axis through its centre. Later, the cloud is observed to have changed its form to that 
of a thin uniform circular disk of radius b which is rotating about an axis through its centre and 
perpendicular to its plane. Find the angular speed of the disk and the increase in the kinetic 
energy of the cloud. 


11.5 Conical pendulum with shortening string A particle is suspended from a support by a 
light inextensible string which passes through a small fixed ring vertically below the support. 
Initially the particle is performing a conical motion of angle 60°, with the moving part of the 
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string of a. The support is now made to move slowly upwards so that the motion remains 
nearly conical. Find the angle of this conical motion when the support has been raised by a 
distance a/2. [Requires the numerical solution of a trigonometric equation. ] 


11.6 Baseball bat A baseball bat has mass M and moment of inertia Mk? about any axis 
through its centre of mass G that is perpendicular to the axis of symmetry. The bat is at rest 
when a ball of mass m, moving with speed u, is normally incident along a straight line through 
the axis of symmetry at a distance b from G. Show that, whether the impact is elastic or not, 
there is a point on the axis of symmetry of the bat that is instantaneously at rest after the impact 
and that the distance c of this point from G is given by bc = k?. In the elastic case, find the 
speed of the ball after the impact. [Gravity (and the batter!) should be ignored throughout this 
question. ] 


11.7 Hoop mounting a step A uniform hoop of mass M and radius a is rolling with speed 
V along level ground when it meets a step of height h (h < a). The particle C of the hoop 
that makes contact with the step is suddenly brought to rest. Find the instantaneous speed of 
the centre of mass, and the instantaneous angular velocity of the hoop, immediately after the 
impact. Deduce that the particle C cannot remain at rest on the edge of the step if 


2 ky 
V* > (a-hA)g({1-—-— ; 
2a 


Suppose that the particle C does remain on the edge of the step. Show that the hoop will 
go on to mount the step if 
2 iy 
Vi > hg {l-— ‘ 
2a 


Deduce that the hoop cannot mount the step in the manner described if h > a/2. 


11.8 Particle sliding on acone A particle P slides on the smooth inner surface of a circular 
cone of semi-angle a. The axis of symmetry of the cone is vertical with the vertex O pointing 
downwards. Show that the vertical component of angular momentum about O is conserved in 
the motion. State a second dynamical quantity that is conserved. 

Initially P is a distance a from O when it is projected horizontally along the inside surface 
of the cone with speed u. Show that, in the subsequent motion, the distance r of P from O 
satisfies the equation 


Dilys 
P= (r —a) |“ <t ad 2g cosa. 


Case A For the case in which gravity is absent, find and the azimuthal angle ¢ explicitly 
as functions of t. Make a sketch of the path of P (as seen from ‘above’) when a = 7/6. 


Case B For the case in which a = 77/3, find the value of u such that r oscillates between 
a and 2a in the subsequent motion. With this value of u, show that 7 will first return to the 
value; = a after a time 


1/2 72 
HAO) fae ae 
8 1 (6 -DA@—§)2+ 38)] 
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11.9% Bug running on a hoop A uniform circular hoop of mass M can slide freely on a 
smooth horizontal table, and a bug of mass m can run on the hoop. The system is at rest when 
the bug starts to run. What is the angle turned through by the hoop when the bug has completed 
one lap of the hoop? [This is a classic problem, but difficult. Apply the angular momentum 
principle about the centre of the hoop, using the non-standard version given in Problem 11.1] 


Planar rigid body motion 


11.10 General rigid pendulum A rigid body of general shape has mass M and can rotate 
freely about a fixed horizontal axis. The centre of mass of the body is distance h from the 
rotation axis, and the moment of inertia of the body about the rotation axis is 7. Show that the 
period of small oscillations of the body about the downward equilibrium position is 


1 \1 
m (Go). 
Mgh 
Deduce the period of small oscillations of a uniform rod of length 2a, pivoted about a horizon- 
tal axis perpendicular to the rod and distance b from its centre. 


11.11 From sliding to rolling A snooker ball is at rest on the table when it is projected 
forward with speed V and no angular velocity. Find the speed of the ball when it eventually 
begins to roll. What proportion of the original kinetic energy is lost in the process? 


11.12 Rolling or sliding? A uniform ball is released from rest on a rough plane inclined at 
angle a to the horizontal. The coefficient of friction between the ball and the plane is uw. Will 
the ball roll or slide down the plane? Find the acceleration of the ball in each case. 


11.13 A circular disk of mass M and radius a is smoothly pivoted about its axis of symmetry 
which is fixed in a horizontal position. A bug of mass m runs with constant speed u around the 
rim of the disk. Initially the disk is held at rest and is released when the bug reaches its lowest 
point. What is the condition that the bug will reach the highest point of the disk? 


11.14 Yo-yo with moving support A uniform circular cylinder (a yo-yo) has a light inexten- 
sible string wrapped around it so that it does not slip. The free end of the string is fastened to 
a support and the yo-yo moves in a vertical straight line with the straight part of the string also 
vertical. At the same time the support is made to move vertically having upward displacement 
Z(t) at time f. Find the acceleration of the yo-yo. What happens if the system starts from rest 
and the support moves upwards with acceleration 2g ? 


11.15 Supermarket belt A circular cylinder, which is axially symmetric but not uniform, has 
mass M and moment of inertia Mk? about its axis of symmetry. The cylinder is placed on a 
rough horizontal belt at right angles to the direction in which the belt can move. Initially the 
cylinder and the belt are both at rest when the belt begins to move with velocity V(t). Given 
that there is no slipping, find the velocity of the cylinder at time rf. 

Explain why drinks bottles tend to spin on a supermarket belt (instead of moving forwards) 
if they are placed at right-angles to the belt. 
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11.16% Falling chimney A uniform rod of length 2a has one end on a rough table and is 
balanced in the vertically upwards position. The rod is then slightly disturbed. Given that its 
lower end does not slip, show that, in the subsequent motion, the angle @ that the rod makes 
with the upward vertical satisfies the equation 


2a6* = 3g(1 — cos@). 


Consider now the the upper part of the rod of length 2ya, as shown in Figure 11.15. Let T, $ 
and K be the tension force, the shear force and the couple exerted on the upper part of the rod 
by the lower part. By considering the upper part of the rod to be a rigid body in planar motion, 
find expressions for S and K in terms of @. 

If a tall thin chimney begins to fall, at what point along its length would you expect it to 
break first? 


Rigid body statics 


11.17 Leaning triangular panel A rough floor lies in the horizontal plane z = O and the 
planes x = 0, y = 0 are occupied by smooth vertical walls. A rigid uniform triangular panel 
ABC has mass m. The vertex A of the panel is placed on the floor at the point (2, 2, 0) and the 
vertices B, C rest in contact with the walls at the points (0, 1, 6), (1, 0, 6) respectively. Given 
that the vertex A does not slip, find the reactions exerted by the walls. Deduce the reaction 
exerted by the floor. 


11.18 Triangular coffee table A trendy swedish coffee table has an unsymmetrical triangular 
glass top supported by a leg at each vertex. Show that, whatever the shape of the triangular 
top, each leg bears one third of its weight. 


11.19 Pile of balls Three identical balls are placed in contact with each other on a horizontal 
table and a fourth identical ball is placed on top of the first three. Show that the four balls 
cannot be in equilibrium unless (1) the coefficient of friction between the balls is at least J3 _ 
V2, and (ii) the coefficient of friction between each ball and the table is at least }(/3 — V2). 
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Lagrange’s equations 
and conservation principles 


KEY FEATURES 

The key features of this chapter are generalised coordinates and configuration space, the 
derivation and use of Lagrange’s equations, the Lagrangian, and the connection between 
symmetry of the Lagrangian and conservation principles. 


Lagrange’s equations mark a change in direction in our development of mechanics. 
Building on the work of d’Alembert, Lagrange* devised a general method for obtaining 
the equations of motion for a very wide class of mechanical systems. In earlier chap- 
ters we have used conservation principles for this purpose, but there is no guarantee that 
enough conservation principles exist. In contrast, Lagrange’s method is completely gen- 
eral and is not restricted to problems soluble by conservation principles. The method is 
so simple to apply that it is quite possible to solve complex mechanical problems whilst 
knowing very little about mechanics! However, the supporting theory has its subtleties. 

Lagrange’s equations also mark the beginning of analytical mechanics in which gen- 
eral principles, such as the connection between symmetry and conservation principles, 
begin to take over from actual problem solving. 


12.1 CONSTRAINTS AND CONSTRAINT FORCES 


A general mechanical system S consists of any number of particles P;, Po, ..., 
Py. The particles of S may have interconnections of various kinds (light strings, springs 
and so on) and also be subject to external connections and constraints. These could include 
features such as a particle being forced to remain on a fixed surface or suspended from a 


* Joseph-Louis Lagrange (Giuseppe Lodovico Lagrangia), (1736-1813). Although Lagrange is often con- 
sidered to be French, he was in fact born in Turin, Italy and did not move to Paris until 1787. Lagrange 
had a long career in Turin and Berlin during which time he made major contributions to mechanics, fluid 
mechanics and the calculus of variations. His famous book Mécanique Analitique, published in Paris in 
1788, is a definitive account of his contributions to mechanics. This work transformed mechanics into a 
branch of mathematical analysis. Perhaps to emphasise this, there is not a single diagram in the whole 
book! 
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FIGURE 12.1 The general mechanical system 
S consists of any number of particles {P;} 

(i = 1...,N). The typical particle P; has 
mass ™m;, position vector r; and velocity v;. 
F . is the specified force and F 4 the 
constraint force acting on Pj. 


fixed point by a light inextensible string. The pendulum, the spinning top, the bicycle and 
the solar system are examples of mechanical systems. 


Unconstrained systems 


If the particles of S are free to move anywhere in space independently of each other then 
S is said to be an unconstrained system. In this special case, the equations of motion for 
S are simply Newton’s equations for the N individual particles. Suppose that the typical 
particle P; has mass m;, position vector r; and velocity v;. Then the equations of motion 
for the system S are 


mv; = F; (Gi =1...,N), 
where F’; is the force acting on the particle Pj. 


Example 12.1 Two-body problem 


Write down the Newton equations for the two-body gravitation problem. 


Solution 

In this problem S consists of two particles moving solely under their mutual gravi- 
tational attraction. There are no constraints. The motion of the system is therefore 
governed by the two Newton equations 


2 —T) ri —12 


mv, =m\m2G m2v2 =m\m2G 


Fi relP 
where G is the constant of gravitation. These equations, together with the initial 
conditions, are sufficient to determine the motion of the two particles. Hl 


Constrained systems 


Unconstrained mechanical systems are relatively rare. Indeed many of the problems 
solved in earlier chapters involve mechanical systems that are subject to geometrical or 
kinematical constraints. Geometrical constraints are those that involve only the position 
vectors {r;}; kinematical constraints involve the {v;} as well. Some typical constraints 
are as follows: 
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The bob of a pendulum must remain a fixed distance from the point of support. 


The particles of a rigid body must maintain fixed distances from each other. 
e A particle sliding on a wire must not leave the wire. 


e The contact particle of a body rolling on a fixed surface must be at rest. 


The rolling condition is a kinematical constraint since it involves the velocity of a 
particle. All the other constraints are geometrical. 

These, and all other constraints, are enforced by constraint forces. Constraint forces 
are not part of the specification of a system and are therefore unknown. For example, 
when a particle is constrained to slide on a wire, it is prevented from leaving the wire 
by the force that the wire exerts upon it. This constraint force (which would commonly 
be called the reaction of the wire on the particle) is unknown; we know only that it is 
sufficient to keep the particle on the wire. 

For constrained systems the straightforward approach of using the Newton equations 
runs into the following difficulties: 


A_ The equations of motion do not incorporate the constraints 
The Newton equations (in Cartesian coordinates) do not incorporate the con- 
straints. These must therefore be included in the form of additional conditions 
to be solved simultaneously with the dynamical equations. 


B_ The constraint forces are unknown 
For constrained systems, the Newton equations have the form 


mjd; = FS + FC (1 <i <N), (12.1) 


where F 7 is the specified force and F a is the constraint force acting on the 
particle P;. The F 7 are known but the F c are not. 


Because of these two difficulties, only the simplest problems of constrained motion 
are tackled this way. In the following sections we show how these difficulties can be over- 
come. The first difficulty is overcome by using a new (reduced) set of coordinates called 
generalised coordinates, while the second is overcome by using Lagrange’s equations 
instead of Newton’s. 


12.2 GENERALISED COORDINATES 


Suppose that the system is subject to geometrical constraints only. Then the 
position vectors {r;} of its particles are not independent variables, but are related to each 
other by these constraints. A possible ‘position’ of such a system is called a configuration. 
More precisely, a set of values for the position vectors {r;} that is consistent with the 
geometrical constraints is a configuration of the system. 

The trick is to select new ‘coordinates’ that are independent of each other but are still 
sufficient to specify the configuration of the system. These new coordinates are called 
generalised coordinates and their official definition is as follows: 
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FIGURE 12.2 The variables x and @ are a set of generalised 
coordinates for this system. 


Definition 12.1 Generalised coordinates If the configuration of a system S is deter- 
mined by the values of a set of independent variables q,, ..., dn, then {q\, ..-, Gn} is said 
to be a set of generalised coordinates for S. 


This definition deserves some explanation. 


(i) 


(ii) 


When we say the generalised coordinates must be independent variables, we mean 
that there must be no functional relation connecting them. If there were, one of the 
coordinates could be removed and the remaining n — 1 coordinates would still deter- 
mine the configuration of the system. The set of generalised coordinates must not be 
reducible in this way. 

When we say the generalised coordinates gi, ..., ¢, determine the configuration of 
the system S, we mean that, when the values of the coordinates g1, ..., gy are given, 
the position of every particle of S is determined. In other words, the position vectors 
{r;} of the particles must be known functions of the independent variables q1,..., dn, 
that is, 


rj =Ti(Q1,---> dn) (a — a arene he (12.2) 


Abstract though this concept may seem, generalised coordinates are remarkably easy 


to use. In practice, they are chosen to be displacements or angles that appear naturally in 


the problem. This is illustrated by the following examples. 


Example 12.2 Choosing generalised coordinates 


Let S be the system shown in Figure 12.2 which consists of two particles P; and 
P> connected by a light rigid rod of length a. The particle P; is constrained to move 
along a fixed horizontal rail and the system moves in the vertical plane through the 
rail. Select generalised coordinates for this system and obtain expressions for the 
position vectors r;, r2 in terms of these coordinates. 


Solution 


Consider the variables x, 6 shown. These are certainly independent variables 
(they are not connected by any functional relation) and, when they are given, the 
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configuration of S is determined. Thus {x, 0} is a set of generalised coordinates for 
the system S. 

In terms of the coordinates x and 0, the positions of the particles P, and P2 are 
given by 


rj =xi, 
(x + asin@)i — (acos@)k, 


r2 
which are the expressions (12.2) for this system and this choice of coordinates. li 


Example 12.3 Choosing more generalised coordinates 


Choose generalised coordinates for the system consisting of three particles P;, P2, P3 
where P,, Pz are connected by a light rigid rod of length a and P2, P3 are connected 
by a light rigid rod of length b. The system slides on a horizontal table. [Make a 
sketch of the system.] 


Solution 

Many choices of generalised coordinates are possible. Let Oxyz be a system of 
rectangular coordinates with O on the table and Oz pointing vertically upwards. One 
set of generalised coordinates consists of (i) the x and y coordinates of the particle 
P,, (ii) the angle 6 between the line P; P2 and the x-axis, (iii) the angle @ between the 
line P2 P3 and the x-axis. A second set of generalised coordinates consists of (i) the 
x and y coordinates of the particle Po, (11) the angle 6 between the line P; Pz and the 
y-axis, (iii) the angle @ between the line P; P2 and the line P P3. 


Degrees of freedom 


It is evident from the above example that the configuration of a system can be specified 
by many different sets of generalised coordinates. However the number of coordinates 
needed is always the same. In the last example, the number of generalised coordinates 
needed is always three. 


Definition 12.2 Degrees of freedom Let S be a mechanical system subject to geo- 
metrical constraints. Then the number of generalised coordinates needed to specify the 
configuration of S is called the number of degrees of freedom of S. 


The number of degrees of freedom is an important property of a mechanical system. 
Suppose, for example, that we have a system with three degrees of freedom and gener- 
alised coordinates g1, g2, g3. Suppose also that the system is in some given configuration 
when it is started into motion in some given way. This means that we know the initial 
values of the coordinates q1, q2, 3, and their time derivatives g1, g2, g3. How many equa- 
tions of motion (second order ODEs) do we need to determine the functions qj (t), g2(t), 
q3(t) that describe the subsequent motion of the system? The answer is provided by the 
general theory of ODEs. If the three functions g1(t), g2(t), g3(¢) satisfy three (indepen- 
dent) second order ODEs, then the general theory guarantees that there is precisely one 
solution that satisfies the prescribed initial conditions. If there are fewer equations, the 
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solution is not uniquely determined; if there are more, the equations are not independent. 
This gives us the following important result: 


Degrees of freedom and equations of motion 


The number of degrees of freedom of a system is equal to the number of equations of 
motion (second order ODEs) that are needed to determine the motion of the system. 


Example 12.4 Degrees of freedom 


State the number of degrees of freedom of the following mechanical systems: (i) the 
simple pendulum, (ii) the spherical pendulum, (iii) a door swinging on its hinges, (iv) 
a bar of soap (a particle) sliding on the inside of a basin, (v) four rigid rods flexibly 
jointed to form a quadrilateral which can move on a flat table, (vi) a ball rolling on a 
rough table. 


Solution 
G1 Gi2 Gil Cv)2 (v)4 (vi) Not defined! This system has a kinematical 
constraint, namely, the rolling condition at the contact point. Hl 


Kinematical constraints 


So far we have not discussed kinematical constraints such as the rolling condition. We 
can now handle geometrical constraints since they are automatically taken into account by 
using generalised coordinates. But kinematical constraints involve the particle velocities 
which in turn depend not only on the coordinates q,,..., Gn, but also their time deriva- 
tives q1,...,9Gn- In general, kinematical constraints cannot be incorporated by selecting 
some new Set of generalised coordinates. As a result, such constraints have to remain as 
additional ODEs that must be solved along with the equations of motion. 

All is not lost however since, in some special but important cases, the ODE repre- 
senting the kinematical constraint can be immediately integrated to yield an equivalent 
geometrical constraint. Such a constraint is said to be integrable. 


Example 12.5 An integrable kinematical constraint 


A circular cylinder rolls down a rough inclined plane. Show that, in this problem, the 
rolling condition is an integrable constraint. 


Solution 

In the absence of the rolling condition, this system has two degrees of freedom; take 
as generalised coordinates x (the displacement of the cylinder axis down the plane) 
and 6 (the rotation angle of the cylinder). The rolling condition is then given by the 
first order ODE 


x =a, (12.3) 
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FIGURE 12.3 For the generally rolling wheel, the rolling conditions are 
non-integrable. 


where a is the radius of the cylinder. But this constraint can be integrated (without 
solving the problem!) to give 


x=ade, (12.4) 


on taking x = @ = 0 in the reference configuration. Thus the kinematical constraint 
(12.3) is equivalent to the geometrical constraint (12.4). This geometrical constraint 
can now be incorporated by selecting a new (reduced) set of generalised coordinates. 
In this example, only one generalised coordinate is finally required (either x or 6 ) so 
that the rolling cylinder has one degree of freedom. @ 


Example 12.6 A non-integrable kinematical constraint 


Figure 12.3 shows a circular disk of radius a which is constrained to roll on a hori- 
zontal floor with its plane vertical. Show that, in this problem, the rolling conditions 
are not integrable. 


Solution 


In the absence of the rolling condition, this system has four degrees of freedom. Let 
Oxyz bea fixed system of rectangular coordinates with O on the floor and Oz point- 
ing vertically upwards. Then a set of generalised coordinates is given by 


(i) the x and y coordinates of the centre C of the disk, 
(ii) the angle 6 between the plane of the disc and the x-axis, 


(iii) the angle @ that the disk has rotated about its axis (relative to some reference 
position). 


Now we impose the rolling condition, namely, that the contact particle should 
have zero velocity. In terms of the chosen coordinates, this gives 


x +adcosé =0, > +a¢sind = 0, 


a pair of first order ODEs. These equations cannot be integrated since 9 is an 
unknown function of the time and @ is absent from both equations. It follows that, 
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in this problem, the rolling conditions are not integrable and cannot be replaced by 
equivalent geometrical constraints. i 


Holonomic and non-holonomic systems 


Mechanical systems are classified according as to whether or not they have non-integrable 
kinematical constraints. 


Definition 12.3 Holonomic systems If a system has only geometrical or integrable 
kinematical constraints, then it is said to be holonomic. If it has non-integrable kinemat- 
ical constraints, then it is non-holonomic. 


Non-holonomic systems are the bad guys. In particular, non-holonomic systems do not 
satisfy Lagrange’s equations (as presented later in this chapter). It is beyond the scope of 
this book to proceed any further with the analytical mechanics of such systems and, from 
now on, we will deal only with holonomic systems. (A way of extending the Lagrange 
method to non-holonomic systems is described by Goldstein [4].) Such systems can still 
be treated by standard Newtonian methods however. The problem of the rolling wheel is 
solved in this way in Chapter 19. 


12.3 CONFIGURATION SPACE (g—space) 


Let S be a holonomic mechanical system with generalised coordinates 
q41i--->Gn- It is convenient to regard the list of values qg1,..., Gn as the coordinates of 
a ‘point’ q in a space of n dimensions, that is, 


q = (91, --++9n)- (12.5) 


Mathematicians call such a space E., (the Euclidean space of n dimensions), but we will 


denote it by Q (the space to which q belongs) and call it configuration space. Since 
the values of g1,..., Gn determine the configuration of the system S, it follows that the 
configuration of S is determined by the ‘position’ of the point q in configuration space, 
that is, 


rj =1r;(q) Ds eed gl) 


This abstract view becomes much clearer when applied to a particular example. Let S 
be the two-particle system shown in Figure 12.4. This system has two degrees of freedom 
and generalised coordinates x, 0. In this case the configuration space Q is the (x, @)-plane. 
Each point g = (x, @) lying in Q corresponds to a configuration of the mechanical system 
S. Moreover, as the configuration of the system changes with time, the point g moves 
through the configuration space as shown. 


Generalised velocities 


When the configuration of S changes with time, the point g moves through the configura- 
tion space QO so that g = q(t). This leads to the notion of generalised velocities. 
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FIGURE 12.4 The configuration of the system S is represented by the point qg = (x, @) in the 
configuration space Q. As the configuration of S changes with time, the point g moves on a path 
lying in the configuration space Q. 


Definition 12.4 Generalised velocities The time derivatives q\, ..., Qn of the gener- 
alised coordinates q, ..., Qn are called the generalised velocities of the system S. 


The n-dimensional vector (91, ..., Gn), formed from the {q;}, is just the time deriva- 
tive of the vector q, that is, 


@ = (Gis. xida)s (12.6) 


The vector g can be regarded as the ‘velocity’ of the point q as it moves through the 
configuration space Q. 


Particle velocities 


The values of q and q determine the position and velocity of every particle of the system 
S. For, since r; = r;(q) and q = q(t), it follows from the chain rule that 


or; or; . or; . 
v= Ay ese = qj- (12.7) 
0q1 Ogn " > 0qj / 
This expression for v; is linear in the variables g1,..., Gn with coefficients that depend 


on q. 
Example 12.7 Rule for finding particle velocities 


What is the connection between the formula (12.7) and the rule we have often used to 
find particle velocities? 


Solution 

Formula (12.7) says that v; is the vector sum of contributions, each one arising from 
the variation of a particular q;. This therefore justifies our rule for finding particle 
velocities. 
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Example 12.8 Finding the kinetic energy 


Find the particle velocities for the two-particle system shown in Figure 12.2, and 
deduce the formula for the kinetic energy. 


Solution 
The velocities of the particles P,, Pz are given by 


vy, = Xi, v2 =xXi+(acosdi +asin0k) 6. 
The kinetic energy of the system is therefore given by 


T = }m(v, +01) + 5m (v2 + v2) 
= sx? + 5m (i? + (a0)? + 2%(a6) cos 6) 


=mir+ (4ma’) 67 + (macos@) x6. 
Example 12.9 General form of the kinetic energy 


Show that the kinetic energy of any holonomic mechanical system has the form 


T=) >) aje(q) aja 


j=l k=1 
that is, a homogeneous quadratic form in the variables g1, ..., Gn, with coefficients 
depending on q. 
Solution 


Let P be a typical particle of S with position vector r and velocity v. Then 


or , or , = or , 
v= —Gi t+ + Gn = Gj 


Oq1 Oqn = 0qj 
and so 
(sas + in) (Soa + in) 
vevV= —gi se — . —q1 aoe — 
0q1 a Odn a Oq1 q adn a 
n ar n n 
=(S Hai) -( Ha) = Oo (Z-S) sate 

jal oqj ha 04k falta ” Ogk 

which is a homogeneous quadratic form in the variables g1, ..., gn, with coefficients 


depending on q. The kinetic energy of S is then given by 


n n 


N 
mae i (Vj +0;)= So ai @ 4/4: 


j=l k=1 
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where 


= Or; Or; 
1 i I 
aj =5 mij . . 
jk(qQ) 9] d, i (= =) 
It follows that T is also a homogeneous quadratic form in the variables g1, ..., Gn, 
with coefficients depending on q. Hl 


12.4 D’ALEMBERT’S PRINCIPLE 


For a holonomic system, we can overcome the problem that the position vectors 
{r;} are not independent variables by using generalised coordinates. We must now over- 
come the problem that the constraint forces are unknown. 
The Newton equations of motion for the general mechanical system S are 


mjd; = F> + FS (1<i<N), (12.8) 


where F : is the specified force and F e is the constraint force acting on the particle 
P;. The {F 3 } are known while the {F } are unknown. The trick is to construct linear 
combinations of the equations (12.8) so as to eliminate the {F rh 

Let a,(t), ao(t), ..., ay(t) be any vector functions of the time. Then, by taking 
the scalar product of equation (12.8) with a; and summing over 7, we obtain the scalar 
equation 


N 


N N 
Yi mii; -a; = )) FP ai + D0 FE + ay. (12.9) 
i=1 i=1 


i=1 


The question now is whether we can make 
N 
SFE +a; =0 (12.10) 
i=1 


by a cunning choice of the functions {a;}. More precisely, since S has n degrees of 
freedom, we need n linearly independent choices of the {a;} that make the equation (12.10) 
true. 

Actually, we already know one choice of the {a;} that makes the equation (12.10) 
true. Suppose that the total rate of working of the constraint forces is zero, which is true 
for many constraints (see Chapter 6). This condition can be written 


N 
Yo FE +0; =0, 
i=l 


where v; is the velocity of the particle P; at time t. Thus the condition (12.10) certainly 
holds for such a system if the {a;} are chosen to be the particle velocities {v;}. With 
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FC 


FIGURE 12.5 The particle P belonging to the 
system S is constrained to slide on the 
smooth fixed surface X. 


this choice, the {F a are eliminated from equation (12.9). The result of this operation is 
actually well known to us; it leads to the energy principle for the system! This is not quite 
what we are looking for, but it does suggest what the correct choices of the {a;} might be. 

Now comes the clever bit. For all the usual constraints that do no work, it is also true 
that the stronger condition 


N 
Yo FF - vf =0 (12.11) 


i= 
i=l 


holds, where the {v*} are any kinematically possible set of particle velocities at time t. 
The {v*} need not be the actual particle velocities at time ¢. For example, suppose a 
particle P of S is constrained to move on a smooth fixed surface X. Let v be the actual 
velocity of P as shown in Figure 12.5. Since & is a smooth surface, the constraint force 
FC that it exerts must be normal to X. Moreover, any kinematically possible motion of S 
at time ¢ gives P a velocity v* that is tangential to X. It follows that 


for any choice of v* that is kinematically possible. Although there is no theorem to this 
effect, a similar conclusion can be drawn for all the usual constraint forces that do no 
work. 

A set of velocities {v*} that is kinematically possible at time f is called a virtual 
motion of the system. The condition (12.11) is therefore equivalent to the statement that 
the total rate of working of the constraint forces is zero in all virtual motions, or, more 
briefly, that the constraint forces do no virtual work. We have therefore obtained the 
following result, known as d’ Alembert’s principle.* 


* After Jean le Rond d’Alembert (1717-1783). He was baptised Jean le Rond after being found aban- 
doned on the steps of a Paris church of that name. His principle was published in 1743 in his Traite de 
Dynamique. 
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D’Alembert’s principle 


If the constraint forces on a system do no virtual work, then 


N N 
Yi mii; - vf = FS - uF, (12.12) 
v= 


j= 


where {v*} is any virtual motion of the system at time f. 


Differential form of d’Alembert’s principle 
D’Alembert’s principle is often quoted in the equivalent differential form 


So mij -drj = > FS -drj;, 
i i 


where the {dr;} are any kinematically possible set of infinitesimal displacements of the particles {P;} at 


time ¢. This form is also known as the principle of virtual work. 


D’Alembert’s principle is not often applied directly, except in statical problems. We 
have obtained it because it leads to Lagrange’s equations. 


12.5 LAGRANGE’S EQUATIONS 


From now on, we will suppose that our mechanical system is holonomic and that 
its constraint forces do no virtual work. 


Definition 12.5 Standard system If a mechanical system is holonomic and its con- 
straint forces do no virtual work, we will call it a standard system. 


Consider then a standard mechanical system with n degrees of freedom and gener- 
alised coordinates g = (41, 92,.--,9n). Consider first the virtual motion {v*} generated 
by prescribing the generalised velocities at time f to be 


qa 1, qQ2 “= dn 0. 


From (12.7) it follows that the corresponding particle velocities are given by 


v® = Or; 
Oq1 


Cree a 


1 


Since we are assuming our system to be holonomic, these {v*} are a kinematically possi- 
ble set of velocities.* Furthermore, since we are assuming that the constraint forces do no 


* This is the only point in the derivation of Lagrange’s equations where it is essential that the system be 
holonomic. 
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virtual work, d’Alembert’s principle holds. It therefore follows that 


N 


Bg ie a uk Or; 
i ‘ 


8a1 


A similar argument holds when the {gq ;} are prescribed to be g) = 0, g2 = 1, 43 
Gn = 0, and so on. We thus obtain the system of equations 


N 


Or; = or; 
demidi ~=) °F) - - (j =1,...,n). (12.13) 
i=l i=l qi 


These are essentially Lagrange’s equations. It remains only to put them into a form that is 
easy to use. In fact, the left sides of equations (12.13) can be constructed simply from the 
kinetic energy of the system. The result is as follows: 


Suppose the holonomic system S has kinetic energy T (q, q). Then the left sides of equa- 
tions (12.13) can be written in the form 


Or; d ({0T oT 


: 0qj 0d; 0qj 


(1 < j <xn), where, for the purpose of calculating the partial derivatives, T is considered 
to be a function of the 2n independent variables q\,...,;4n, 1. -++Qn- 


Lagrange partial derivatives 

The partial derivatives of T that appear in equations (12.14) are peculiar to Lagrange’s equa- 
tions. In the expression for T, the coordinate velocities g1, ... Gn are considered to be inde- 
pendent variables in addition to the coordinates q1,...,@n. Consider, for example, the two 
particle system in Example 12.2. For this system 


T=mx?+ (4ma’) @? 4: (ma cos @) x0 


and this expression is considered to be a function of the four independent variables x, 6, x, 6 
(x is absent). The Lagrange partial derivatives of T are therefore 
oT oT 


oT . Or . 
Pe 0, at = 2mx+(ma cos 6)6, ry = —(ma sin@)x6, ry = ma?6-+(ma cos 0)x. 


The proof of the formula (12.14) is straightforward (once Lagrange had found the 
answer!) but a bit messy because of the many suffices. 


Proof of the formula (12.14) 
Since 
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it follows that 


The last step follows since, in the formula for v;, q and q are regarded as independent vari- 


ables*. Then 
dfay, _ ar; d (ar; 
—_= = (50; - vi) =UvU;°- + UV; ° 
dt 0qj 0qj dt 0qj 


. Or; ke a°r; 
op a a 


0qj fay 99k 99j 


after a further application of the chain rule. In a similar way, 
n 


a (: ) dv; a ari. “re, 
— (50; °0;) =0;- = vj + —— dk) =; ° dks 
gj 75 ' dqj | 8; d gk 2 qj 99k 


where, in the formula for 0; /q;, we have regarded q and q as independent variables. Com- 
bining these two results gives 


d{ oa 0 . Or; 
& [ae (0) ]- (dum) <6 
dt | dq; 0qj 0qj 


If we now multiply by m; and sum over i we obtain 


oar ef ar; 
ir ) ear 


0qj 0qj 


which is the required result. Ml 


For general specified forces {F;} there is no simplification for the right sides of the 
equations (12.14), but we do give them names: 


Definition 12.6 Generalised force The quantity Q ;, defined by 


Or; 
Nae De ares 
i 


0qj 


is called the generalised force corresponding to the coordinate qj. 


We have therefore proved that: 


* The formula 
Or; _ Ori 
dqj qj 


is sometimes facetiously referred to as ‘cancelling the dots’. Only mathematicians find this amusing. 


338 Chapter 12 Lagrange’s equations and conservation principles 


Lagrange’s equations for a general standard system 


Let S be a standard system with generalised coordinates q, kinetic energy T (q, q) 
and generalised forces {Q ;}. Then, in any motion of S, the coordinates q(t) must 
satisfy the system of equations 


al ff Ost or 
= ; l=) SO). 2a 
di ial fp ee es 


This is the form of Lagrange’s equations that applies to any standard system. 


Conservative systems 


When the standard system is also conservative, the generalised forces {Q ;} can be written 
in terms of the potential energy V (q) as 


Q; =-—. (12.16) 


[This result is simply a generalisation of the formula F = — grad V .] 


Proof of the formula (12.16) 
Let q“, q® be any two points of configuration space that can be joined by a straight line 


parallel to the g;-axis. Then 


q’ q’ Or; 
is — qs ;  0qj ’ 7 vG 
B 


q 
=vay)-va®)=- f want 
q’ °j 


This equality holds for all g4, q? chosen as described, which implies that the two inte- 
grands must be equal. Hence 


as required. i 


We have therefore proved that: 
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Lagrange’s equations for a conservative standard system 


Let S be a conservative standard system with generalised coordinates q, kinetic 
energy T (q,q) and potential energy V(q). Then, in any motion of S, the coordi- 
nates q(t) must satisfy the system of equations 


d ( 0T oT aV 
= l<=7j SM). 217 
ze ) (l<j<n) (12.17) 


These are Lagrange’s equations for a conservative standard system. This is by far 
the most important case; most of analytical mechanics deals with conservative systems. 
It is remarkable that all one needs to obtain the equations of motion for a conservative 
system are the expressions for the kinetic and potential energies. 


Sufficiency of the Lagrange equations 

We have shown that if S is a conservative standard system then Lagrange’s equations (12.17) 
must hold. Thus Lagrange’s equations are necessary conditions for g(t) to be a motion of S. 
It does not seem possible to reverse this argument to show that the Lagrange equations are 
also sufficient. (Where does the reverse argument break down?) However, from the general 
theory of ODEs, we are assured that there is a unique solution of the Lagrange equations 
corresponding to each set of initial values for g, g. Thus the Lagrange equations actually are 
sufficient to determine the motion of S. 


The Lagrange method for finding the equations of motion of a conservative system is 
summarised below: 


Lagrange’s method for conservative systems 


Confirm that the system is standard and that the specified forces are conservative. 


Select generalised coordinates. 


Evaluate the expressions for T and V in terms of the chosen coordinates*. 


Substitute these expressions into the Lagrange equations (12.17) and turn the 
handle. It’s a piece of cake! 


Example 12.10 Using Lagrange’s equations: | 


Consider a block of mass m sliding on a smooth wedge of mass M and angle a which 
itself slides on a smooth horizontal floor, as shown in Figure 12.6. The whole motion 


* See Chapter 9 for the details of how to find T. 
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Xx 


FIGURE 12.6 The block slides on the smooth surface of the wedge which slides 
on a smooth horizontal floor. 


is planar. Find Lagrange’s equations for this system and deduce (i) the acceleration 
of the wedge, and (ii) the acceleration of the block relative to the wedge. 


Solution 


This is a standard conservative system with two degrees of freedom. Take as gener- 
alised coordinates x, the displacement of the wedge from a fixed point on the floor, 
and y, the displacement of the block from a fixed point on the wedge. The calcula- 
tion of the kinetic and potential energies in terms of x, y is performed exactly as in 
Chapter 9 and gives 


T = 4Mi?+34m (= +7 + 2%} cosa) , 
V = —mgy sina. 


The required partial derivatives of T and V are then given by 


oT oT : ; oV 

— = 0, — =(M+m)x+(mcosa)y, —= 

ox ox ox 

oT oT : . oV . 
—=0, — =(mcosa)x +my, — = —mg sina. 
dy ay dy 


We can now form up the Lagrange equations. The equation corresponding to 
the coordinate x is 


d 

az CM mda + em cosa)y]—0 = 0, (12.18) 
and the equation corresponding to the coordinate y is 

d 

a cosa)x +my]—0O = mgsina. (12.19) 


If we now perform the time derivatives in equations (12.18), (12.19) and solve 
for the unknowns x, ¥ we obtain 


mg sina cos a .  (M+m)g sina 
M +msin* 7 2 


’ 5 ? 
a M+msin~ a 


which are the required accelerations. They are both constant. 
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O (b—a)6 


FIGURE 12.7 The small solid cylinder rolls on the 
inside surface of the large fixed cylinder. 


These results can of course be obtained by more elementary means. For instance we 
could solve this problem by appealing to conservation of horizontal linear momentum and 
energy. However the Lagrange method does have the advantage that less physical insight 
is needed to solve the problem. If the system is a standard one and T and V can be 
calculated, then turning the handle produces the equations of motion. 


Example 12.11 Using Lagrange’s equations: Il 


Figure 12.7 shows a solid cylinder with centre G and radius a rolling on the rough 
inside surface of a fixed cylinder with centre O and radius b > a. Find the Lagrange 
equation of motion and deduce the period of small oscillations about the equilibrium 
position. 


Solution 


If the cylinder were not obliged to roll, the system would have two degrees of freedom 
with generalised coordinates 6 (the angle between OG and the downward vertical) 
and @ (the rotation angle of the cylinder measured from some reference position). 
The rolling condition imposes the kinematical constraint 


(b —a)6 — ad = 0. 
This constraint is integrable and is equivalent to the geometrical constraint 
(b—a)0 —ahb=0 


on taking ¢ = 0 when 6 = 0. Thus the rolling cylinder is a standard conservative 
system with one degree of freedom. 
Take 6 as the generalised coordinate. Then the kinetic energy is given by 


i— 5m ((b _ ab) + 5 (5ma’) b 
2 
= sm ((b _ ab) + 5 (3ma’) = *) 6? 


= 3m(b - a)°6" 


342 Chapter 12 Lagrange’s equations and conservation principles 


FIGURE 12.8 The system moves under the prescribed force F(t). 


and the potential energy by 
V = —mg(b —a)cosé. 


There is only one Lagrange equation, namely 


d a. . 
< | ime a) é| 0 = —mg(b—a)siné 


which simplifies to give 


2g 


—— sind = 0. 
*Sp2n 


Interestingly, this equation is identical to the exact equation for the oscillations of a 
simple pendulum of length 3(6 — a)/2 as obtained in Chapter 6. 


The linearised equation governing small oscillations of the cylinder about 6 = 0 
is 


2g 
= 
. 3(b — a) 


so that the period t of small oscillations is given by 


1/2 
=" (== a i: 
2g 


Example 12.12 Using Lagrange’s equations: III 


Let S be the system shown in Figure 12.8. The rail is smooth and the prescribed force 
F(t) acts on the particle P2 as shown. Gravity is absent. Find the Lagrange equations 
for S. 


Solution 


S is a standard system with two degrees of freedom. The new feature is the prescribed 
external force F(t) acting on P2. This time dependent force cannot be represented by 
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a potential energy and so the generalised forces {Q ;} must be evaluated direct from 
the definition (12.16). 

Take generalised coordinates x, 0 as shown and let the corresponding generalised 
forces be called Q,, Qg. Then, since S has just two particles, 


On = Ff. ha wf. 2, 

Qo = Ff. + rf. 2, 
where 

FS =0, F} =F()i, 
and 


rj =Xi, ro = (x + asin@)i — (acos@)k. 
The generalised forces Q,, Qg are therefore given by 
0, =0+ (F()i)-t= FO) 
and 
Og = 0+ (F(t)i) + (acosOi +asindk) = (acos@) F(t). 
The kinetic energy is given by 
T =mi?+ (ma cos 0)x6 + sme? 


and so the Lagrange equations are 

d : i 

r [2mx + (ma cos6)6] = F(t), 

d ‘ ; 

FY [(ma cos0)x + mo | - [—(ma sin 0)x0 | = (acos@)F (t). @ 


Question Incorporating extra forces 


How would you incorporate gravity into the last example? 


Answer 


Since the expression (12.16) for the {Q ;} is linear in the {Fj}, the extra forces are 
incorporated by just adding in their respective contributions to the {Q ;}. Thus when 
gravity is present in the last example, Q,, Og become 


QO, = F(t)+0, Qo = (acos@)F (t) — mga sin@. & 


Many more examples of the use of Lagrange’s equations are given in the problems at 
the end of the chapter. 
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FIGURE 12.9 The pendulum with a moving 
support. 


12.6 SYSTEMS WITH MOVING CONSTRAINTS 


The theory of Lagrange’s equations can be extended to include a fascinating class 
of problems in which the constraints are time dependent. Consider the system shown in 
Figure 12.9 which is a simple pendulum in which the support point A is made to move 
vertically so that its downward displacement from the fixed origin O at time ¢ is some spec- 
ified function Z(t). For example it could be made to oscillate so that Z(t) = Zo cos pt. 
With this constraint, the coordinate @ is no longer sufficient to specify the position of the 
particle P. In fact, relative to the origin O, the position vector of P at time f is given by 


r = (asin@)i+ (Z(t) +acosé@)k, 


so that r is a function of 6 and ¢, not just 6. Constraints which cause the {r;} to depend 
on q and ¢ (and not just q) are called time dependent constraints, or simply moving 
constraints. Systems that have moving constraints include: 


Systems with moving constraints 


e Systems in which particles are forced to move in a prescribed manner. 


e Systems in which particles are forced to remain on boundaries that move in a 
prescribed manner. 


e Systems in which the motion is viewed from a frame of reference that is acceler- 
ating or rotating in a prescribed manner. 


e Systems in which beetles, mice (or lions!) move around in a prescribed manner. 
(These creatures are highly trained!) 


We assume our systems are such that they would be standard if the constraints were 
fixed. We will refer to such systems as standard systems with moving constraints. 
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Kinematics of systems with moving constraints 


The configuration {7;} of a system with moving constraints is specified by 
rj =ri(q.t) (<i <N). (12.20) 


Here, the time ¢ has the réle of an ‘additional coordinate’. However, it is not a true coor- 
dinate and we will still regard the system as being holonomic with n degrees of freedom. 
The corresponding particle velocities are given by 


ori. Ori. Or; 
“Gi tees + Gn + —. (12.21) 


vj = 
0q1 Oqn 


This expression is still a linear form in the variables {q ;} but it is not homogeneous; there 
is now a ‘constant’ term (a function of q and f). 


Question Form of the kinetic energy 
What is the form of the kinetic energy when moving constraints are present? 


Answer 
It follows from the above expression for the particle velocities that T has the form 


T(9,4,t)= > Do ajea. 4dr + D> bjG.04; +9, 0), 


j=l k=1 j=! 


which is still a quadratic form in the variables {gq ;}, but it is not homogeneous; there 
are now linear terms and a constant term. 


Energy not conserved with moving constraints 


Another feature of systems with moving constraints is that the constraint forces do work. 
This is quite obvious from the driven pendulum example above. The constraint force that 
causes the specified displacement of the support point A will generally have a vertical 
component and, since A is moving vertically, this force will do work. So, even when the 
specified forces are conservative (as gravity is in the driven pendulum example), the total 
energy T + V is not a constant because the constraint force does work. Hence, systems 
with moving constraints are generally not conservative. 


Lagrange’s equations with moving constraints 


There are good reasons to expect that systems with moving constraints do not satisfy 
Lagrange’s equations. In general, constraint forces that enforce moving constraints do 
work. Since virtual motions include the special case of real motion, surely such con- 
straints must also do virtual work; then d’Alemberts principle and Lagrange’s equations 
will not hold. Compelling though this argument seems, it is false. Systems with moving 
constraints do satisfy Lagrange’s equations! To see why this is so, one must identify the 
crucial steps in the derivation of Lagrange’s equations. There are actually only three: 
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e Are the equations 4 FC - (dr; /dq;) = O still true? 


This question could be posed in the form ‘do the constraint forces do virtual 
work?’ and we have presented a plausible argument that they do. Consider 
however the meaning of the partial derivatives dr;/dqj;. Since we now have 
rj = rj(q,t), 0r;/dq, means the derivative of r; with respect to gq, keeping 
2, Y3x++> Gn and the time t constant. Thus these derivatives are calculated at 
constant ¢. It follows that the virtual motion defined by the {d7; /dq,} is kinemati- 
cally consistent with constraints that are fixed at time f, not with the actual moving 
constraints. Hence, since ar F . - (dr; /0g;) would be zero if the constraints 
were fixed, it is still zero when the constraints are moving!* 


Is the formula (12.14) still true? 

The point here is that the formula (12.21) for the particle velocities now has the 
extra term dr; /0t which might upset the formula (12.14). However, it does not. 
The proof of this is left as an exercise. 


When the specified forces are conservative, is the formula ae | F}-(0ri/dqj) = 
dV /dq; still true? 

The potential energy V is a function of the configuration of the system, but, since 
the configuration is now specified by g and t, V = V (qg,t). However, since the 
partial derivatives dr; /dq; and dV /dq; are evaluated at constant f, the proof of 
the formula is unchanged. 


We have therefore obtained the following result: 


Lagrange’s equations still hold when moving constraints are present provided that, 
in the expressions for T and V, the time ¢ is regarded as an independent variable. 


Lagrange’s equations with moving constraints 


Example 12.13 Pendulum with an oscillating support 


Find the Lagrange equation for the driven pendulum for the case in which the dis- 
placement function Z(t) = Zocos pt. [Assume that the ‘string’ is a light rigid rod 
that cannot go slack. ] 


Solution 
This system has one degree of freedom and a moving constraint at A. Take @ as the 
generalised coordinate. It follows from Figure 12.9 that the kinetic energy T is given 


by 


T= sm (06? + Z* = 2907 sin@) 


* The situation is sometimes loosely expressed by the mysterious statement that ‘moving constraints do 
real work but no virtual work . 
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FIGURE 12.10 Motions of the driven pendulum. Top: p/Q = 1.1, Zo/a 
= 0.2 and 6) = 0.1. Botton: p/Q = 1.9, Zo/a = 0.2 and @ = 0.1. 


and the potential energy V by 
V =-mg(Z+acos@). 


The required partial derivatives are therefore 


oT a: — oT me OV . 
—~ =m (a 6 —aZsiné), — =-—maéZcosd, —~=megasine. 
00 00 00 


The Lagrange equation corresponding to the coordinate 6 is therefore 
d He a ; 
ae (a0 + Z sin 0) — maéZ cos? = —mgasiné, 
which simplifies to give 
6+ (2° —a7'Z) sind =0, 


where Q? = g/a. Hence, for the case in which Z = Zo cos pt, the Lagrange 
equation is 


a 2 Zop . _ 
6 + { Q* + — cos pt }) sind = 0.8 (12.22) 
a 


Question Motions of the driven pendulum 
What do the pendulum motions look like? 
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Answer 


The equation (12.22) has some fascinating solutions, but they can only be found 
numerically. First we will reduce the number of parameters by putting the equa- 
tion in dimensionless form. If we define the dimensionless time t by t = pt, then 
the equation becomes 


d°0 ri Oo , (2 ied 
cost }) sind = 0. 
dt? p2 a 


We can now see that the solutions depend on the dimensionless driving frequency 
p/2 and the dimensionless driving amplitude Zq/a. 

One interesting question is whether the small oscillations of the pendulum about 
6 = O are destabilised by the motion of the support. The answer is that it depends 
on the dimensionless parameters p/Q and Zo/a in a complicated way. Figure 12.10 
shows results obtained by numerical solution of the equation with initial conditions 
of the form 6 = 0, 9 = 0 when t = 0. The top graph shows the motion for the case 
p/8 = 1.1, Zo/a = 0.2 and 6) = 0.1. In this graph, 0/00 is plotted against t/27 
(the number of oscillations of the support). The motion turns out to be stable with 
the amplitude of the oscillations remaining close to their initial value. The bottom 
graph shows the motion for the case p/Q = 1.9, Zo/a = 0.2 and 6 = 0.1. In this 
graph, 0/2z (the number of revolutions of the pendulum) is plotted against t/27 (the 
number of oscillations of the support). This motion turns out to be unstable. The 
amplitude of the oscillations grows until the pendulum performs complete circles; 
it then stops and goes the opposite way. Numerical results suggest that this chaotic 
motion continues indefinitely. m™ 


THE LAGRANGIAN 


The Lagrange equations of motion (12.17) for a conservative standard system are 


expressed in terms of the kinetic energy T = T (q, q) and the potential energy V = V(q). 


They can however be written in terms of the single function T — V. Since dV /dq; = 0, 


the equations can be written 


that is, 


d ( oT oT d (0V aV . 
= Cara); 
dt 0qj 0qj dt 0qj 0qj 
d (o0L OL 0 dejen 
= n j 
dt \0qj; 0qj ad = 


where L(q,q) = T(q,q) — V(q) is called the Lagrangian of the system. The same 


operation can be applied to systems with moving constraints whose specified forces are 


conservative. The only difference is that, in this case, L = L(q, q,t). 
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Writing the Lagrange equations in this form makes no difference whatever to problem 
solving. However, any system of equations that can be written in this way has special 
properties. In particular, it is equivalent to a stationary principle (see Chapter 13), and 
can also be written in Hamiltonian form (see Chapter 14). This is the form most suitable 
for advanced developments and for making the transition to quantum mechanics. There 
is therefore a strong interest in any physical system whose equations can be written in 
Lagrangian form. 


Definition 12.7 Lagrangian form If the equations of motion of a holonomic system 
with generalised coordinates q can be written in the form 


d (OL OL : 
=0 (sa 7 =n); (12.23) 
dt 0qj 


for some function L = L(q,4q,t), then L is called the Lagrangian of the system and the 
equations are said to have Lagrangian form. 


For example, the Lagrangian for the driven pendulum is 
L(G, 6, th= 5m (a°6? + 7 —2a07 siné) +mg(Z+acos@), 


where Z = Z(t) is the displacement of the support point. 


Velocity dependent potential 


There are systems whose specified forces are not conservative (so that V does not exist), 
but their equations of motion can still be written in Lagrangian form. Any standard system 
with generalised forces {Q ;} satisfies the Lagrange equations (12.15). If it happens that 
the generalised forces can be written in the form 


0 d (au aU (<j <n) (12.24) 
= n), i 
‘ dt \ aq j oq j =e 

for some function U(q,q,t), then clearly the equations (12.15) can be written in 
Lagrangian form by taking 


L(q, q. t) = T(q, q. t) _ U(q, q.t). 


The function U (q, q, t) is called the velocity dependent potential of the system. 

This seems to be a mathematical artifice that has no importance in practice. It is true 
that there is only one important case in which a velocity dependent potential exists, but 
that case is very important; it is the case of a charged particle moving in electromagnetic 
fields. The following example proves this to be so for static fields. The corresponding 
result for electrodynamic fields is the subject of Problem 12.15. 
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Example 12.14 Charged particle in static EM fields 


A particle P of mass m and charge e can move freely in the static electric field E = 
E(r) and the static magnetic field B = B(r). The electric and magnetic fields exert 
a force on P given by the Lorentz force formula 


F=eE+evxB, 


where v is the velocity of P. Show that this force can be represented by a velocity 
dependent potential U(r, r) and find the Lagrangian of the system. 


Solution 
In the static case, Maxwell’s equations for the electromagnetic field reduce to 


div D= p=, curl E = 0, curl H = j, divB =0. 


In particular, the equation curl E = 0 implies that E(r) is a conservative field and 
can be written in the form 


E = — grad@ 


where ¢ = ¢(r) is the electrostatic potential. The equation div B = 0 implies that 
B(r) can be written in the form 


B=curlA, 


where A = A(r) is the magnetic vector potential.” 

Take the generalised coordinates to be the Cartesian coordinates x, y z of the 
particle and, from now on, let r mean (x, y, z). What we are looking for is a velocity 
dependent potential U(r, r) that yields the correct generalised forces when substi- 
tuted into the equations (12.24). In the present case the generalised forces Q,, Oy, 
Q, are simply the x- y- and z-components of the Lorentz force F’. The electric part 
of the force, eE is easily dealt with since it is conservative and can be represented by 
the ordinary potential energy V = e@(r). It is the magnetic part of the force that 
needs the velocity dependent potential. One wonders how anyone found the correct 
U, but they did, and it turns out to be —e r+ A(r). All we need to do is to check that 
this potential is correct. 

Consider therefore the potential 


U=r-A(r) =£A,(r) + yAy(*) + 2A,(7). 


* The potential @ is unique to within an added constant, but, for any fixed B, there are many possibilities 
for A. Adding the grad of any scalar function to A does not change the value of B. This ambiguity in 
A makes no difference in the present context; any choice of A such that B = curl A will do. The actual 
determination of vector potentials is described in textbooks on vector field theory (see Schey [11]) for 
example. 
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Then 


and so 


d (au 0U d _0Ax 0A OA: 
( ) (Ay) -x—— -y— 


dt \ ox 0 dt ox Ox Ox 
dAx.  OAy.  OAx., .0A, dA, dA; 
x yt Z x y= Zz 
Ox dy Oz ox ox Ox 


_(dAy dA, _ (0A, OA; 
il. +2(— = 
Ox oy Oz Ox 
= —y (curl A), + Z (curl A), = —yB, + zBy 
=— (7 x B) : 


x 


When multiplied by —e this is Q, for the magnetic part of the force. The values of 
Qy and Q-, are confirmed in the same way. 

We have therefore proved that the Lorentz force is derivable from the velocity 
dependent potential 


U =eo(r)—er- A(r) (12.25) 


and the Lagrangian of the particle is therefore 


L=jmr-r—edg(r) +er- A(r) (12.26) 


Question Why bother? 
Why should we find the Lagrangian for this system when we already know that the 
equation of motion is 


dv 
m—=eE+evxB? 
dt 
Answer 
The interest in this Lagrangian is that, from it, one can find the Hamiltonian, and this 
is what is needed to formulate the corresponding problem in quantum mechanics. 
This problem has important applications to the spectra of atoms in magnetic fields. Ml 


12.8 THE ENERGY FUNCTION / 


Let S be any holonomic mechanical system with Lagrangian L (q, q, t). Then the 
equations of motion for S take the form (12.23). On multiplying the j-th equation by g; 
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and summing over j we obtain 


“Td (aL OL). 
os dt \da;)  aq;| 7 
= qj qj 


Note that dL /dt means the partial derivative of L(q,q,t) with respect to its final argu- 
ment, holding g and g constant. Thus 


— =0 (12.27) 


where 


c=), (5-4) —L (12.28) 


Definition 12.8 Energy function The function h defined by equation (12.28) is called 
the energy function of the system S. 

The energy function h is a generalisation of the notion of energy. For conservative 
systems, we will show that it is identical with the total energy E = T + V. However, for 
non-conservative systems, V may not exist and, even if it does, / and EF are not generally 
equal. There are three typical cases: 


Case A If L = L(q, q,t), then OL /ot 4 0 and A is not conserved. 


Example 12.15 / for the driven pendulum 


Find the energy function / for the driven pendulum problem. 


Solution 
In the driven pendulum problem, 


m (a°6? 4.77 = 2407 sin@) +mg(Z+acosé), 
and so 


h=6 36 —-L= sm («°6? _ Zz) —mg(Z +acosé). 
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This is not the same as the total energy 
T+V= 5m (a°6? ug? Qae7, sind) —meg(Z +acosé), 


and neither quantity is conserved. 


Case B If L = L(q,q) then dL/dt = O so that h is a constant. The conservation 
formula 


n 
(5-4) — L = constant (12.29) 


is called the energy integral of the system S. 
Systems for which L = L(q,q) are said to be autonomous. The above result can 
therefore be expressed in the form: 


Autonomous systems conserve /1 


In any motion of an autonomous system, the energy function h(q, q) is conserved. 


Example 12.16 A charge moving in a magnetic field 


Find the energy integral for a particle of mass m and charge e moving in the static 
magnetic field B(r). 


Solution 
For this problem 


L=jmr-r+er-A(r), 


where A is the magnetic vector potential. Since dL /dt = 0, the energy integral exists 
and has the form 


Roa Fe : L=h, 
where / is a constant. On using the formula for L, this becomes 
m (2+ 5° +2) ter A- 1m (+5742) er A=h, 
that is 


pm (7 +5? +2?) =h. 
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This is the required energy integral. In this case, the constant / is the kinetic energy 


of the particle. 


This result is well known. When a charged particle moves in a magnetic field, 
the force is perpendicular to the velocity of the charge. Thus no work is done by the 
magnetic field and so the kinetic energy of the particle is conserved. For this system, 
V does not exist since the force exerted by the magnetic field is velocity dependent; 


the total energy E is therefore not defined. 


Case C If S is a conservative standard system, then S is autonomous and so h is 
conserved. In addition, the energy integral can be written in a more familiar form. In this 


case, L = T — V, where T has the form 


n n 


T= S050 aja) 44 


j=lk=l 
(see Example 12.9), and V = V(q). Hence 


aL sOaT . 
— == OS 2 Ajk(Q) Ik 
0qj 0qj dX 


and so 


n 
j=l 7 j=lk=l 
The energy integral therefore becomes 
2T — (T — V) =constant, 


that is 


T + V =constant 


OL n n 
2254, =2) > agg) aoe = oF 


(12.30) 


which is the classical form of conservation of energy. In this case, the constant is the 


total energy E of the system. 


12.9 GENERALISED MOMENTA 


The generalised momenta of a mechanical system are defined in a different way 


to conventional linear and angular momentum. 


Definition 12.9 Generalised momenta Consider a holonomic mechanical system 


with Lagrangian L = L(q, q, t). Then the scalar quantity pj; defined by 


_ aL 


Pi 9g; 
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is called the generalised momentum corresponding to the coordinate qj. It is also called 
the momentum conjugate to qj. 


Example 12.17 Finding generalised momenta 
Consider the problem in Example 12.10 whose Lagrangian is 

| — 5Mx? + sm (:? + yr + 2xy cos) +mgy sina. 
Find the generalised momenta. 


Solution 
With this Lagrangian, the momenta p, and py are given by 


oL ; sy 
age ee COE): 
x 


L 
Py =~ =m(y+xcosa). mf 
dy 
Generalised momenta are often recognisable as components of linear or angular 
momentum of the system. In the above example, p, is the horizontal component of the 


linear momentum of S, but py is not a component of linear momentum. 
Conservation of generalised momenta 
In terms of the generalised momentum p;, the j-th Lagrange equation can be written 
Fe 
dt dq; 
It follows that if dL /dq ; = 0 (that is, if the coordinate q ; is absent from the Lagrangian), 


then the generalised momentum pj; is constant in any motion. Such ‘absent’ coordinates 
are said to be cyclic. We have therefore shown that: 


Conservation of momentum 


If qj is a cyclic coordinate (in the sense that it does not appear in the Lagrangian), 
then p;, the generalised momentum conjugate to g;, is constant in any motion. 


In the last example, the coordinate x is cyclic but y is not. It follows that p, is 
conserved but py is not. 


Example 12.18 A cyclic coordinate for the spherical pendulum 


Consider the spherical pendulum shown in Figure 11.7. The Lagrangian L is given 
by 


L= sma” [6 + (sin@ oy] +mgacos6, 
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where 0, @ are the polar angles shown. Verify that @ is a cyclic coordinate and find 
the corresponding conserved momentum. 


Solution 
Since dL /d¢ = 0, the coordinate ¢ is cyclic. It follows that the conjugate momentum 
P¢ is conserved, where 
OL Sct 3 
= — =ma’‘ sin’ 0 ¢. 
Po ag 
This generalised momentum is actually the angular momentum of the pendulum about 
the polar axis. ml 


12.10 SYMMETRY AND CONSERVATION PRINCIPLES 


The existence of a cyclic coordinate is not the only reason why a generalised 
momentum (or momentum-like quantity) may be conserved. Indeed, whether a cyclic 
coordinate is present depends not only on the system, but also on which coordinates are 
chosen; if the ‘wrong’ coordinates are chosen then the conserved quantity will be missed. 
The existence of conserved quantities of the form F(q, q) is in fact closely linked with 
symmetries of the system. We illustrate this by the following two results, which are the 
most important of such cases. 


Theorem 12.1 Invariance of V under translation Let S be a conservative standard 
system with potential energy V. Then if S can be translated (as if rigid) parallel to a 
constant vector n without violating any constraints, and if V is unchanged by this trans- 
lation, then, in any motion of S, the component of linear momentum in the n-direction is 
conserved. 


Proof. Let {r;} be any configuration of S and let the corresponding point in configuration 
space be q. Then a (rigid) displacement A in the n-direction will have the effect 


rj; => ee 
where 
re =r; +An. 


Since this displacement is consistent with the system constraints, {r*} is also a configuration 
of S and corresponds to some point q* in configuration space. Thus, in configuration space, 
the displacement has the effect 


q->q’, 
where 
re = 1r;(q*). 


Note that A = 0 corresponds to the undisplaced state so that r+ = r; and q* = q when A = 0. 
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Suppose now that q(t) is a motion of S under the potential V(qg). Then gq satisfies 
Lagrange’s equations which we choose to take in the form 


and summing over j we obtain 


(ari | 897 "av | aq; 
Dome 2 aa | on. ~ S| aa” 
i A=0 A=0 


mice 


by the chain rule. Furthermore 


ar’ 


i 


On 


“55q") 
Sp = 
a 
since - = ri(q) + An in the given displacement. 
In the same way, 


tay lag: : F aq’ d 
ae =] -¥|(2v@) a = ava] =0, 
=e 40 j= 1j 1=0 4=0 


since V is unchanged by the displacement, that is, V (q*) = V (q). On combining these results 
together, we obtain 
>» mM; 0; -n=0. 
i 


Finally, since n is a constant vector, we may integrate with respect to f to obtain 


(Sm) ‘n=, 


i 


where C is a constant. Thus the component of linear momentum in the n-direction is con- 
served. 


For example, this theorem applies to the system shown in Figure 12.6 (the wedge and 
block). This system can be translated in the x-direction without violating any constraints, 
and this translation leaves the potential energy unchanged. The conserved quantity is the 
component of linear momentum in the x-direction. 


358 Chapter 12 Lagrange’s equations and conservation principles 


Theorem 12.2 Invariance of V under rotation Let S be a conservative standard sys- 
tem with potential energy V. Then if S can be rotated (as if rigid) about the fixed axis 
{O, k} without violating any constraints, and if V is unchanged by this rotation, then, in 
any motion of S, the angular momentum about the axis {O , k} is conserved. 

Proof. | The proof closely follows that in the last theorem. Let A be the angle turned in a 


rotation of S about the fixed axis {O, k}, where O is also the origin of position vectors. Then 
by following the same steps, we obtain, as before, 


. 0 ee 
> Mm; Vj * = 0. 
or 
I A=0 
This time the A-derivative means the rate of change with respect to the rotation angle A so that 
ore 
L=kxr?. 

Or 


Since r* = r; when A = 0, it follows that 


So midi : (kxrj) = 0. 


i 


Finally, since k is a constant vector, we may integrate with respect to ¢ to obtain 


(Xmaxs! ° k=C, 


i 


where C is a constant. Thus the angular momentum about the axis {O, k} is conserved. 


For example, this theorem applies to the spherical pendulum. The pendulum can be 
rotated about the axis {O, k} (where O is the support and k points vertically upwards) 
without violating any constraints, and this rotation leaves the potential energy unchanged. 
The conserved quantity is the angular momentum of the pendulum about the vertical axis 
through O. 


These two theorems are powerful tools for identifying conserved components of lin- 
ear or angular momentum even when the system is very complex. For example, any con- 
servative standard system whose potential energy is invariant under a// translations and 
rotations conserves all three components of linear and angular momentum, as well as the 
total energy, making seven conserved quantities in all. 


Noether’s theorem 


The two theorems above are particular instances of an abstract result known as Noether’s 
theorem.” In each of the above cases, there is a one-parameter family of mappings {Mt}, 


* After the German mathematician Emmy Amalie Noether (1882-1935). Despite the obstacles placed in 
the way of women academics at the time, she made fundamental contributions to pure mathematics in 
the areas of invariance theory and abstract algebra. The result now known as Noether’s theorem was 
published in 1918. 
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parametrised by a real variable 4, that act on the configuration space Q, that is, 


Dares 
q— q’. (12.32) 


In each case A = O corresponds to the identity mapping (that is, gq — q), and in each case 
the potential energy V (q) is invariant under {Dt*}, that is, 


V(q*) =V@q). 


From these facts, we were able to prove that, in each case, a certain momentum component 
was a constant of the motion. 

This idea was generalised by Noether to apply to any Lagrangian system and any 
family of mappings {9)t*}, provided that the Lagrangian L is invariant under {9t*} in 
the sense that 


L@.g.o=L@dd 


for all 2. In this formula, q* is a known function of the variables q and A (as defined by the 
mapping 9Jt*), but we have not yet said what we mean by le This is however defined in 
the following commonsense way: let A be fixed and let q* be the image point of a typical 
point q. Suppose now that the point q has velocity q in the configuration space Q. This 
motion of q imparts a velocity to the image point q*, and it is this velocity that we call 
q’. This definition is expressed by the formula 


; oq”. 
=> 4G; (12.33) 


from which we see that q’ is a known function of the variables qg, g and 4. 
The formal statement and proof of Noether’s theorem are as follows: 


Theorem 12.3 Noether’s theorem Let S be a holonomic mechanical system with 
Lagrangian L(q, q,t) and let {9Nt*} be a one-parameter family of mappings that have 
the action 


d 
Mt : 
q — 1 (12.34) 
where q* = q when d = 0. If the mappings {Mt*} leave L invariant in the sense that 


L@q’,4@’,t) =L@,4@,t) (12.35) 


for all x, then the quantity 


n dq* 
» Pj Ea (12.36) 
A= 
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is conserved in any motion of S. [Note that the conserved quantity is not generally one of 
the momenta {p;} but a linear combination of all of them with coefficients depending on 
q:] 

Proof. Let q(t) be any physical motion of the system S, that is, a solution of the Lagrange 
equations 


i L 
e i = 0 d = J < n), 
dt \dqj/ qj 


where L(q, q, ¢) is the Lagrangian of the system S. Now consider the expression 
d (_ | 9q; _ d {aL | oq; 
dt \P/| aa ~ at \ aq; | Oa 
A=0. A=0. 
d (=) Z| aL (; | 
Se a SaaS a | = 
dt \0qj; on _ dqj \dt | 0a i 
_ OL Ea , OL (5 Ea 
0qj Ox =5 0qj dt i 
on using the j-th Lagrange equation. Now, by the chain rule, 
d [8q° ta fog? a (.aq* aq’ 
Tae =a mae c= — yak meas 
dt \ aa = Oqgx \ OA OA ear Odk Or 
by the definition (12.33) of a It follows that 


+ (>, [2] aL | _ ob ze] 
dt On = dqj | OA ia dqj | OA = 


a ag; a ag* 
=|—1@.¢.)0—~4+—1¢.¢.) 
E pt ED ges Oe BR |g 


since q* = q and q* = q when A = 0. On summing this result over j, we obtain 


d [Qo _ | aq; "3 aq7 
— = L ae reves se 
Fi S| 2 7 Da ee ) 7, q's) = 


j=l A=0 


| 
-———1 
XQ 
~ 
mh 
= 
~) 
- o 
L~) 

SS 
~~ 
~— 
—s | 

iL 


by the chain rule. Finally, we appeal to the invariance of L under the mappings {9)t*}. In this 
case, L(q*, g”, t) = L(q, q, t) and so 


d ded d . 
—L(q’,q’,t) = —L(q.q,t)=0 
med 4 ) mean) 
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It follows that 


and this proves the theorem. 


The importance of Noether’s theorem lies in the general notion that an invariance of 
the Lagrangian gives rise to a constant of the motion. Such invariance properties are of 
great importance when the Lagrangian formalism is extended to continuous systems and 
fields. For more details, see Goldstein [4] who will tell you more about Noether’s theorem 
than you wish to know! 


Problems on Chapter 12 


Answers and comments are at the end of the book. 


Harder problems carry a star (). 


Conservative systems 


12.1 A bicycle chain consists of N freely jointed links forming a closed loop. The chain can 
slide freely on a smooth horizontal table. How many degrees of freedom has the chain? How 
many conserved quantities are there in the motion? What is the maximum number of links the 
chain can have for its motion to be determined by conservation principles alone? 


12.2 Attwood’s machine A uniform circular pulley of mass 2m can rotate freely about its 
axis of symmetry which is fixed in a horizontal position. Two masses m, 3m are connected by 
a light inextensible string which passes over the pulley without slipping. The whole system 
undergoes planar motion with the masses moving vertically. Take the rotation angle of the 
pulley as generalised coordinate and obtain Lagrange’s equation for the motion. Deduce the 
upward acceleration of the mass m. 


12.3 Double Attwood machine A light pulley can rotate freely about its axis of symmetry 
which is fixed in a horizontal position. A light inextensible string passes over the pulley. At one 
end the string carries a mass 4m, while the other end supports a second light pulley. A second 
string passes over this pulley and carries masses m and 4m at its ends. The whole system 
undergoes planar motion with the masses moving vertically. Find Lagrange’s equations and 
deduce the acceleration of each of the masses. 


12.4 The swinging door A uniform rectangular door of width 2a can swing freely on its 
hinges. The door is misaligned and the line of the hinges makes an angle a with the upward 
vertical. Take the rotation angle of the door from its equilibrium position as generalised coor- 
dinate and obtain Lagrange’s equation for the motion. Deduce the period of small oscillations 
of the door about the equilibrium position. 
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12.5 A uniform solid cylinder C with mass m and radius a rolls on the rough outer 
surface of a fixed horizontal cylinder of radius b. In the motion, the axes of the two 
cylinders remain parallel to each other. Let 6 be the angle between the plane con- 
taining the cylinder axes and the upward vertical. Taking 6 as generalised coordinate, 
obtain Lagrange’s equation and verify that it is equivalent to the energy conservation 
equation. 

Initially the cylinder C is at rest on top of the fixed cylinder when it is given a very small 
disturbance. Find, as a function of 0, the normal component of the reaction force exerted on 
C. Deduce that C will leave the fixed cylinder when 9 = cos~!(4/7). Is the assumption that 
rolling persists up to this moment realistic? 


12.6 A uniform disk of mass M and radius a can roll along a rough horizontal rail. A par- 
ticle of mass m is suspended from the centre C of the disk by a light inextensible string of 
length b. The whole system moves in the vertical plane through the rail. Take as generalised 
coordinates x, the horizontal displacement of C, and 6, the angle between the string and the 
downward vertical. Obtain Lagrange’s equations. Show that x is a cyclic coordinate and find 
the corresponding conserved momentum p,. Is px the horizontal linear momentum of the 
system? 

Given that 9 remains small in the motion, find the period of small oscillations of the 
particle. 


12.7 A uniform ball of mass m rolls down a rough wedge of mass M and angle a, which 
itself can slide on a smooth horizontal table. The whole system undergoes planar motion. 
How many degrees of freedom has this system? Obtain Lagrange’s equations. For the special 
case in which M = 3m/2, find (i) the acceleration of the wedge, and (ii) the acceleration of 
the ball relative to the wedge. 


12.8 A rigid rod of length 2a has its lower end in contact with a smooth horizontal floor. 
Initially the rod is at an angle @ to the upward vertical when it is released from rest. The 
subsequent motion takes place in a vertical plane. Take as generalised coordinates x, the 
horizontal displacement of the centre of the rod, and 6, the angle between the rod and the 
upward vertical. Obtain Lagrange’s equations. Show that x remains constant in the motion 
and verify that the 0-equation is equivalent to the energy conservation equation. 

* Find, in terms of the angle 0, the reaction exerted on the rod by the floor. 


Moving constraints 


12.9 A particle P is connected to one end of a light inextensible string which passes through 
a small hole O in a smooth horizontal table and extends below the table in a vertical straight 
line. P slides on the upper surface of the table while the string is pulled downwards from 
below in a prescribed manner. (Suppose that the length of the horizontal part of the string is 
R(t) at time ¢.) Take 0, the angle between OP and some fixed reference line in the table, as 
generalised coordinate and obtain Lagrange’s equation. Show that @ is a cyclic coordinate and 
find (and identify) the corresponding conserved momentum pg. Why is the kinetic energy not 
conserved? 
If the constant value of pg is mL, find the tension in the string at time f. 
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12.10 A particle P of mass m can slide along a smooth rigid straight wire. The wire has 
one of its points fixed at the origin O, and is made to rotate in the (x, y)-plane with angular 
speed Q. Take r, the distance of P from O, as generalised coordinate and obtain Lagrange’s 
equation. 

Initially the particle is a distance a from O and is at rest relative to the wire. Find its 
position at time ¢. Find also the energy function / and show that it is conserved even though 
there is a time dependent constraint. 


12.11 Yo-yo with moving support A uniform circular cylinder (a yo-yo) has a light inex- 
tensible string wrapped around it so that it does not slip. The free end of the string is 
fastened to a support and the yo-yo moves in a vertical straight line with the straight part 
of the string also vertical. At the same time the support is made to move vertically hav- 
ing upward displacement Z(t) at time ¢. Take the rotation angle of the yo-yo as gener- 
alised coordinate and obtain Lagrange’s equation. Find the acceleration of the yo-yo. What 
upwards acceleration must the support have so that the centre of the yo-yo can remain 
at rest? 

Suppose the whole system starts from rest. Find an expression for the total energy E = 
T + V at time ¢. 


12.12 Pendulum with a shortening string A particle is suspended from a support by a light 
inextensible string which passes through a small fixed ring vertically below the support. The 
particle moves in a vertical plane with the string taut. At the same time the support is made to 
move vertically having an upward displacement Z(t) at time f. The effect is that the particle 
oscillates like a simple pendulum whose string length at time ¢ is a — Z(t), where a is a 
positive constant. Take the angle between the string and the downward vertical as generalised 
coordinate and obtain Lagrange’s equation. Find the energy function / and the total energy E 
and show that h = E — mZ?. Is either quantity conserved? 


12.13% Bug ona hoop A uniform circular hoop of mass M can slide freely on a smooth 
horizontal table, and a bug of mass m can run on the hoop. The system is at rest when the bug 
starts to run. What is the angle turned through by the hoop when the bug has completed one 
lap of the hoop? 


Velocity dependent potentials and Lagrangians 


12.14 Suppose a particle is subjected to a time dependent force of the form F = 
f(t) grad Wr). Show that this force can be represented by the time dependent potential 
U = —f (t)W(r). What is the value of U when F = f(t)i? 


12.15 Charged particle in an electrodynamic field Show that the velocity dependent poten- 
tial 


U =edg(r,t)—er- A(r,t) 


represents the Lorentz force F = e E +e vx B that acts on a charge e moving with velocity 
v in the general electrodynamic field {E (r,t), B(r, t)}. Here {@, A} are the electrodynamic 
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potentials that generate the field {E, B} by the formulae 


0A 
B= pe = B=curl A. 


Show that the potentials 6 = 0, A = tzi generate a field {F, B} that satisfies all four 
Maxwell equations in free space. A particle of mass m and charge e moves in this field. Find 
the Lagrangian of the particle in terms of Cartesian coordinates. Show that x and y are cyclic 
coordinates and find the conserved momenta px, Py. 


12.16% Relativistic Lagrangian The relativistic Lagrangian for a particle of rest mass mo 
moving along the x-axis under the simple harmonic potential field V = 5m 27x? is given by 


+2\ 1/2 
L = moc? (: (1 =) 5m Q?x?. 
c 


Obtain the energy integral for this system and show that the period of oscillations of amplitude 
a is given by 


4 rl 1+ 12 cos? 6 
T= / 2 dé, 
Q 0 1.2 2 1/2 
1+ qeé~ COS 0 


where the dimensionless parameter € = Qa/c. 
Deduce that 


20 
r= [1+ ne +0(“)], 
when € is small. 


Conservation principles and symmetry 


12.17 A particle of mass m moves under the gravitational attraction of a fixed mass M situated 
at the origin. Take polar coordinates r, 9 as generalised coordinates and obtain Lagrange’s 
equations. Show that 6 is a cyclic coordinate and find (and identify) the conserved momentum 


Pe- 


12.18 A particle P of mass m slides on the smooth inner surface of a circular cone of semi- 
angle a. The axis of symmetry of the cone is vertical with the vertex O pointing downwards. 
Take as generalised coordinates 7, the distance OP, and @¢, the azimuthal angle about the 
vertical through O. Obtain Lagrange’s equations. Show that @ is a cyclic coordinate and find 
(and identify) the conserved momentum pg. 


12.19 A particle of mass m and charge e moves in the magnetic field produced by a current 
I flowing in an infinite straight wire that lies along the z-axis. The vector potential A of the 
induced magnetic field is given by 


I 
Av=Ag=t, Ap == (S) Inr, 
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where r, 0, z are cylindrical polar coordinates. Find the Lagrangian of the particle. Show that 
@ and z are cyclic coordinates and find the corresponding conserved momenta. 


12.20 A particle moves freely in the gravitational field of a fixed mass distribution. Find 
the conservation principles that correspond to the symmetries of the following fixed mass 
distributions: (i) a uniform sphere, (ii) a uniform half plane, (iii) two particles, (iv) a uniform 
right circular cone, (v) an infinite uniform circular cylinder. 


12.21% Helical symmetry A particle moves in a conservative field whose potential energy 
V has helical symmetry. This means that V is invariant under the simultaneous operations (i) 
a rotation through any angle a about the axis Oz, and (ii) a translation ca in the z-direction. 
What conservation principle corresponds to this symmetry? 


Computer assisted problem 


12.22 Upside-down pendulum A particle P is attached to a support S by a light rigid rod of 
length a, which is freely pivoted at S. P moves in a vertical plane through S and at the same 
time the support S is made to oscillate vertically having upward displacement Z = €a cos pt 
at time ¢. Take 0, the angle between SP and the upward vertical, as generalised coordinate 
and show that Lagrange’s equation is 


g— (2 + Ep” cos pt) sind = 0, 


where Q? = g/a. The object is to show that, for suitable choices of the parameters, the 
pendulum is stable in the vertically upwards position! 

First write the equation in dimensionless form by introducing the dimensionless time tT = 
pt. Then 6(t) satisfies 


d6 QQ? n —— 
—=—=S 1 € cost |}sino = VU. 
dt2 p2 


Solve this equation numerically with initial conditions in which the pendulum starts from rest 
near the upward vertical. Plot the solution 6(t) as a function of t for about twenty oscillations 
of the support. Try « = 0.3 with increasing values of the parameter p/ in the range 1 < 
p/& < 10. You will know that the upside-down pendulum is stable when @ remains small in 
the subsequent motion. 


Even more surprisingly, it is possible to stabilise the double pendulum (or any multiple pendulum) in 
the upside-down position by vibrating the support. See Acheson [1] for photographs of a triple pendulum 
(and even a length of floppy wire) stabilised in the upside-down position by vibrating the support. However, 
the famous but elusive ‘Indian Rope Trick’, in which a small boy climbs up a self-supporting vertical rope, 


has yet to be demonstrated! 
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The calculus of variations 
and Hamilton’s principle 


KEY FEATURES 

The key features of this chapter are integral functionals and the functions that make them 
stationary, the Euler-Lagrange equation and extremals, and the importance of variational 
principles. 


The notion that physical processes are governed by minimum principles is older than 
most of science. It is based on the long held belief that nature arranges itself in the 
most ‘economical’ way. Actually, many ‘minimum’ principles have, on closer inspection, 
turned out to make their designated quantity stationary, but not necessarily a minimum. 
As a result, they are now known to be variational principles, but they are no less impor- 
tant because of this. A good example of a variational principle is Fermat’s principle of 
geometrical optics, which was proposed in 1657 as Fermat’s principle of least time in the 
form: 


Of all the possible paths that a light ray might take between two fixed points, the 
actual path is the one that minimises the travel time of the ray. 


Fermat showed that the laws of reflection and refraction could be derived from his 
principle, and proposed that the principle was true in general. Not only did Fermat’s 
principle ‘explain’ the known laws of optics, it was simple and elegant, and was capable 
of extending the laws of optics far beyond the results that led to its conception. This 
example explains why variational principles continue to be sought; it is because of their 
innate simplicity and elegance, and the generality of their application. 
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The variational principle on which it is possible to base the whole of classical mechan- 
ics was discovered by Hamilton* and is known as Hamilton’s principle." In its original 
form, it stated that: 


Of all the kinematically possible motions that take a mechanical system from one 
given configuration to another within a given time interval, the actual motion is the one 
that minimises the time integral of the Lagrangian of the system. 


Lagrange’s equations of motion can be derived from Hamilton’s principle, which can 
therefore be taken as the basic postulate of classical mechanics, instead of Newton’s laws. 
More importantly however, Hamilton’s principle has had a far reaching influence on many 
areas of physics, where apparently non-mechanical systems (fields, for example) can be 
described in the language of classical mechanics, and their behaviour characterised by 
‘Lagrangians’. Hamilton’s principle is generally regarded as one of the most elegant and 
far reaching principles in physics. 

In order to get concrete results from a variational principle, it is usually necessary to 
convert it to a differential equation. This can be done by using the calculus of variations, 
which is concerned with minimising or maximising the value of an integral functional. The 
calculus of variations is a large subject and we develop only those aspects most relevant to 
interesting physical problems, and to the understanding and use of variational principles. 


13.1 SOME TYPICAL MINIMISATION PROBLEMS 


The calculus of variations arose from attempts to solve minimimisation and max- 
imisation problems that occur naturally in physics and mathematics, but the scope of appli- 
cations has since widened greatly. We begin by describing three minimisation problems 
taken from geometry, physics and economics respectively. Maximisation problems also 
occur, but these can be converted into minimisation problems merely by reversing the sign 
of the quantity to be maximised. Thus we lose no generality by presenting the theory for 
minimisation problems only. 


1. Shortest paths — geodesics A basic problem of the calculus of variations is that of 
finding the path of shortest length that connects two given points A and B. If the path 
has no constraints to satisfy, such as having to go round obstacles or lie on a given curved 


* Sir William Rowan Hamilton (1805-1865), was a great genius but an unhappy man. He was appointed 
Professor of Astronomy at Trinity College Dublin at the age of twenty one, whilst still an undergraduate. 
Much of his early work is on optics where he introduced the notion of the characteristic function. His 
paper On a General Method in Dynamics, which contains what is now called Hamilton’s principle, was 
presented to the Royal Irish Academy in 1834. He was knighted in 1835. However, his personal life was 
as chaotic as his academic achievments were brilliant. He was frustrated in love, frequently depressed 
and a heavy drinker; this culminated in his making an exhibition of himself at a meeting of the Irish 
Geological Society. He spent the later years of his life working on the theory of quaternions, but they 
were never the great discovery he had hoped for. 

Hamilton’s principle is sometimes called the principle of least action. The terminology in this area is 
confusing, since another variational principle of mechanics, Maupertuis’s principle, is also referred to as 
the principle of least action. 
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FIGURE 13.1 A soap film is stretched between two circular wires. It has the form of a surface of 
revolution generated by rotating the curve y = y(x) about the x-axis. 


surface, the answer is well known; it is the straight line joining A and B. However, it is 
still instructive to formulate this problem and to check that the calculus of variations does 
yield the expected result. 

Suppose that A = (0,0), B = (1,0) and that the general path in the (x, y)-plane 
connecting these points is y = y(x). Then the total length of the path is 


1 27 1/2 
Lly]= 1 dy ' 
y] ee dx. (13.1) 
0 x 


The problem is to find the function y(x), satisfying the end conditions y(0O) = 0, y(1) = 
0, that minimises the length L. This is the subject of Problem 13.3; the answer is indeed 
the straight line y = 0. 

In general, paths of shortest length are called geodesics. For example, geodesics on 
the surface of a sphere are great circles. Some surfaces (such as the cylinder and cone) are 
developable, which means that they can be rolled out flat without changing any lengths. 
The geodesic can be drawn while the surface is flat (it is now a straight line) and the 
surface can then be rolled back up again. In general however, surfaces are not developable 
and geodesics have to be found by the calculus of variations. 


2. The soap film problem Two rigid circular wires each of radius b have the same axis 
of symmetry and are fixed at a distance 2a from each other. A soap film is created which 
spans the two wires as shown in Figure 13.1. The soap film has the form of a surface of 
revolution with the two circular ends open. What is the shape of the soap film? 

Fortunately, we can formulate this problem without resorting to the theory of thin 
membranes! Since the air pressure is the same on either side of the film, and since the 
effect of gravity is negligible, surface tension is the dominant effect. The condition that 
the total energy be a minimum in equilibrium is therefore equivalent to the condition that 
the area of the film be a minimum. 

Let the film be the surface generated by rotating the curve y = y(x) about the x-axis. 
Then the surface area A of the film is 


Aly] = an f y {1 +37 ax, (13.2) 


=a4 
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where y means dy/dx. The problem is to find the function y(x), satisfying the end 
conditions y(—a) = b, y(a) = b, that minimises the area A. This is the subject of 
Problem 13.7. 


3. A mimimum cost strategy A manufacturer must produce a volume X of a product 
in time T. Let x = x(t) be the volume produced after time t and suppose that there is a 
production cost a + 6x per unit volume of product and a storage cost yx per unit time, 
where a@, 6 and y are positive constants. The term 6x is a simple model of the increased 
costs associated with faster production. Then the total cost C of the production run is 


xX T 
cix1= | Cea ax+ f yxdt, 
0 0 


which can be written in the form 


T 
C[x]= / {(a + Bx)x + yx} dt. (13.3) 
0 


The problem is to find the function x(t), satisfying the end conditions x(0) = 0, x(T) = 
X , that minimises the cost C. This is the subject of Problem 13.6; it is found that producing 
the goods at a uniform rate is not the best strategy. 


Integral functionals 


The expressions L[ y], A[ y] and C[x ] are examples of integral functionals. Functionals 
differ from ordinary functions in that the independent variable is a function, not a num- 
ber; however, the dependent variable is a number, as usual. The calculus of variations is 
concerned with minimising or maximising integral functionals. 


13.2 THE EULER-LAGRANGE EQUATION 


Before we begin the theory proper, it is useful to recall the procedure for finding 
the value of x that minimises an ordinary function f (x) on the interval a < x < b. The 
procedure is as follows: 


1. First find the values of x (in the range a < x < 5) that satisfy the equation f’(x) = 
0. These are the stationary points of f(x). They are so called because, if x* is a 
stationary point, then 


fart +h) — fa) =0 (1?) (13.4) 


for all sufficiently small h. (That is, at a stationary point, the change in f due to a small 
change h in x is of order h?.) 

2. Now determine the nature of each stationary point, that is, whether it is a minimum 
point, a maximum point, or neither. This can usually be done by examining the sign of 
f”(x*). The minimum points are the local minima of {. They are so called because 


fQ@* +h) > f(x*) (13.5) 
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FIGURE 13.2 The minimising function x* is perturbed by the 
admissible variation h. 


for sufficiently small h, but not necessarily for all h. (In other words, the inequality 
f (x) = f (x*) is true when x is close enough to x*.) 

3. Determine the values of f at the extreme points x = a,x = b. 

4. The global minimum of / is then the least of the local minima of f and the extreme 
values of f. 


Each of these steps has its counterpart in the calculus of variations. However, since 
this material is large enough to fill a book by itself, we will mainly be concerned with the 
first step. This will still be enough to narrow down the search for the minimising function 
x*(t) to a finite number of possibilies and often one ends up with only one possibility. 
Thus, if it is ‘known’ (rigorously or otherwise!) that a minimising function does exist, 
then the problem is solved. 

The general problem in the calculus of variations is that of finding a function x*(t) 
that minimises an integral functional of the form 


b 
J[x]= / F (x, x,t) dt, (13.6) 


where F is a given function of three independent variables.* Suppose that the function 
x*(t) minimises the functional /[x ]. This means that 


J(#] =] J[2"] (13.7) 


for all admissible functions x(t). Here admissible means that x must satisfy whatever 
end conditions are prescribed at t = a and t = b. We will always assume that these 


* This means that, despite the fact that x, x and ¢ are clearly not independent of each other (x is a function 
of t and x is the derivative of x), the partial derivatives of F are evaluated as if x, x and t were three 
independent variables. For example, in the case of the cost functional C given by equation (13.3), F = 
(a + Bx)x + yx and the partial derivatives of F are 

OF OF OF 
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conditions have the form x(a) = A and x(b) = B, where A, B are given. It is convenient 
to regard the function x(t) that appears in (13.7) as being composed of x*(t) together with 
a variation® /(t) so that we may alternatively write 


J[x* +h] > J[x*] (13.8) 


for all admissible variations (t). Since x must satisfiy the same end conditions as x*, the 
admissible variations are those for which h(a) = h(b) = 0 (see Figure 13.2). 

Most readers will find the theory that follows quite difficult. To aid understanding, the 
argument is broken down into three separate steps. 


The variation in / and the meaning of ‘stationary’ 


The first step is to give a meaning to the statement that a function x (t) makes the functional 
J [x ] stationary. 

Let x*(t) be any admissible function and A(t) an admissible variation. When h is a 
small variation, we can estimate the corresponding variation in J[ x ] by ordinary calculus, 
as follows: 

Let ¢ have any fixed value. Then x and x are just real numbers and the variation in F 


due to the variation h in x is* 


OF OF 
FQ* +h," +h, t)— FQ" 2) =h_@" +h GP N40 (0? os i?) 
XxX XxX 


when h and h are both small. On integrating both sides of this equation with respect to ¢ 
over the interval [a, b], the corresponding variation in J is given by 


by; OF .OF 
Jxt+n}—Jie"} = f [Eat itn FASE Ot] dt +0 (ee (13.9) 


for small ||/||, where ||/|| is defined by 
[||| = max |h()| + max |A()| 
a<t<b a<t<b 
and is called the norm‘ of h. (When ||/|| is small, both |/(¢)| and |h (t)| are small through- 


out the interval [a, b].) The second term in the integrand of equation (13.9) can be inte- 
grated by parts to give 


b t=b b 
.OF oF d (0F 
/ h—(x,x*,t)dt = | h—(x*, x*,1t) -{ h — | —(x*,x*,t) } dt 
a Ox ox qa at \ Ox 


t=a 


* These are the variations that give the ‘calculus of variations’ its name. 
+ The formula we are using is that, if G(u, v) is a function of the independent variables u and v, then the 
variation in G caused by the variations ug > ug + h and vg > vp +k is given by 


aG le 
Glug +h, v9 +k) ~ Guo, v9) = h— (uo, v9) + k= (up, U9) + O (1? +4 i?) 
i 


for small h, k. Note that the partial derivatives are evaluated at the ‘starting point’ (wo, vg). 

+ Do not be too concerned over the exact definition of ||/||. It must be written in some such way for 
mathematical correctness, but the only property that we will need is that ||/|| is proportional to h in the 
sense that, if / is multiplied by a constant A, then so is ||/||. 
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and, since h is an admissible variation satisfying h(a) = h(b) = 0, the integrated term 
evaluates to zero. We thus obtain 


b 

OF d (oF 
J[x* +h] —J[x*]= / — (x*, x*,t) — — | —(*, x*,1)) | hdt+o (IA11*) : 

a L Ox dt \ ox 
(13.10) 
This is the variation in J caused by the admissible variation h in x when \|\h|| is small. 
The variation in J is therefore linear in to A with an error term of order ||/||?. By analogy 
with the case of ordinary functions, we say that x* makes J[x] stationary if the linear 

term is zero, leaving only the error term. 


Definition 13.1 Stationary J The function x*(t) is said to make the functional J{x | 
stationary if 


JL +h] = JLx*] = 0 (II?) (13.11) 


when ||h|| is small. 


It follows that the condition that x* makes J[x ] stationary is equivalent to the condi- 


tion that 
b 
OF Kk d (oF Kk 
—(x",x",t) -— | —-@",x",t hdt =0 13.12 
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for all admissible variations h. 


Minimising functions make J stationary 


Suppose now that x*(t) provides a local minimum for J [x ] in the sense that 
J[x* +h] > J[x*] (13.13) 


when ||/|| is small. We will now show that such an x* makes J [x ] stationary. 
If we substitute equation (13.10) into the inequality (13.13), we obtain 


b 
i Eeocue ee (Feor.i%.0) hdt +O (iln11?) >0 (13.14) 
a | Ox Ox 


for small ||h||. It follows from this inequality that the integral term must be zero. The 
proof is as follows: 


In the inequality (13.14), let h be replaced by Ah, where A is a positive constant. On dividing through 


by A, this gives 
oT OF 1 (dF 
/ "ot, ) - —(~o*, 29) | nat+10 (i1m1/?) > 0, 
a ox dt \ ox 


On letting A — 0, we find that 


bY aF 1 (OF 
/ “* (x*,*, 1) -= (= *, *,1)) [nat = 0 
a 1 0% dt \ ox 
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bump function h 


FIGURE 13.3 An interval (c, d) in which G(t) > O anda 
corresponding bump function h(t). The integral of G x h 
must be positive. 


for all admissible variations h. In particular, this inequality must remain true if h is replaced by —h, but 
this is only possible if the integral has the value zero. 


Equation (13.10) therefore reduces to 
JIx* +h] —J[x*] = 0 (i117) (13.15) 


for small ||h||. By definition, this means that x* makes J[x] stationary. The same result 
applies to functions that provide a local maximum for J[x ]. Our result is summarised as 
follows: 


Functions that minimise or maximise / make J stationary 


If the function x* provides a local mimimum for the integral functional J[x ], then 
x* makes J stationary. The same applies to functions that provide a local maximum 
for J. 


Euler-Lagrange equation 


We will now obtain the differential equation that must be satified by a function x*(t) that 
makes J [x ] stationary. This is the counterpart of the elementary condition f’(x) = 0. To 
find this, we return to equation (13.12). In the integrand, the function inside the square 
brackets looks complicated but it is just a function of ¢ and, if we denote it by G(¢), then 


b 
/ G(t) h(t) dt =0 (13.16) 


for all admissible variations h. In fact, the only function G(t) for which this is possible is 
the zero function, that is, G(t) = 0 fora < t < b. The proof is as follows: 


Suppose that G(r) is not the zero function. Then there must exist some interval (c, d), lying inside 
the interval (a, b) in which G(t) ¥ 0 and thus has constant sign (positive, say). Take h(t) to be a ‘bump 
function’ (as shown in Figure 13.3), which is zero outside the interval (c, d) and positive inside. For such a 
choice of h, 


b d 
/ G(t) h(t) dt = G(t) h(t) dt > 0, 


Cc 
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since the integral of a positive function must be positive. This contradicts equation (13.16) and so G must 
be the zero function. 
We have therefore shown that 


oF... a (OF y 
Ae De i NO ogi b “an, SNK Xe git =0 
sen a (Fs eel 


fora < t < b, which is the same as saying that x* must satisfy the Euler-Lagrange 
differential equation 


The above argument is reversible and so the converse result is also true. Our result is 
summarised as follows: 


Euler-Lagrange equation 


If the function x* makes the integral functional 


b 
J[x] =|) F (x, ,t) dt, (13.17) 


stationary, then x* must satisfy the Euler-Lagrange differential equation 


d (0F OF 
- —(l), (13.18) 
ahi \ Ost 


The converse result is also true. 


It is very convenient to give solutions of the Euler-Lagrange equation a special name. 


Definition 13.2 Extremals Any solution of the Euler-Lagrange equation is called an 
extremal” of the functional J [x ]. 


Key results of the calculus of variations 
e If the function x* minimises or maximises the functional J, then x* makes J 
stationary and so x* must be an extremal of J. 


e If x* is an extremal of J, then x* makes J stationary, but it may not minimise or 
maximise J. 


The term extremal should not be confused with extremum (plural: extrema). Extremum means maximum 
or minimum. Thus a function that provides an extremum of J must make J stationary and so must be 
an extremal of J (that is, it must satisfy the Euler-Lagrange equation). The converse is not true. An 
extremal may not provide a mimimum or maximum of J. (Try reading this again!) 
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Fortunately, solving problems with the E—L equation is much easier than the preceding 
theory! The E—L equation is a second order non-linear ODE, and so one needs a measure 
of luck when it comes to finding solutions. Nevertheless, many interesting cases can be 
solved in closed form. 


Example 13.1. Finding extremals 1 


Find the extremal of the functional 


2. 22 
Jixj= | at 
1 At 


that satisfies the end conditions x(1) = 5 and x(2) = 11. 


Solution 


By definition, extremals are solutions of the E-L equation. In the present case, F = 
x?/4t so that 


OF _ OF i 


ox” Ox 2t’ 


and the E—L equation takes the form 


On integrating, we obtain 
x=ct?+ d, 


where c and d are constants of integration. The extremals of J are therefore a family 
of parabolas in the (tf, x)-plane. The admissible extremals are those that satisify the 
prescribed end conditions x(1) = 5 and x(2) = 11. On applying these conditions, we 
find that c = 2 and d = 3 so that the only admissible extremal of J[.x ] is given by 


¥=2°+3.m 


Question Maximum, minimum, or neither? 


Does the extremal = 2t? + 3 maximise or minimise J? 


Answer 


The admissible extremal X is known to make J stationary. It may minimise J, max- 
imise J, or do neither. With the theory that we have at our disposal, we cannot 
generally decide what happens. However, in a few simple cases (including this one), 
we can decide very easily. 
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Let h be any admissible variation (not necessarily small) and consider the varia- 
tion in J that it produces, namely, 


2 BY 2 2 
IIE +h] nis f ate a [2 d 
1 


2 h2 2 
: a 7) at [aca 
1 4t 1 
2 
h 
Te fea 
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since hf is an admissible extremal satisifying (1) = h(2) = 0. Hence 


2 j2 
IF +A] n= f dt > 0, 
1 4t 


since the integral of a positive function must be positive. Thus X actually provides the 
global minimum of J[x ]. The global minimum value of J is therefore J [2r7+3]= 
6. H 


A useful integral of the Euler-Lagrange equation 


Not all examples are as easy as the last one and the E—L equation often has a complicated 
form. However, for the case in which the function F (x, x, t) has no explicit dependence 
on f (that is, F = F(x, x)), the second order E—L equation can always be integrated once 
to yield a first order ODE. This offers a great simplification in many important problems. 

Suppose that F = F(x, x). Then it follows from the product rule and the chain rule 


that 
ad {,0F .OF  .d (dF OF OF 
24S F | =4s55 5% : +x 
= ( Ox ) Ox = (=) (x Ox 4 


(da (aF\ aF ae 
-i(($)-F): ari 


Thus, if x satisfies the E-L equation 
d (0F OF 0 
t \ ax a 


it follows that x satisfies the first order equation 


OF 
arr — F =constant. (13.20) 
Xx 


for some choice of the constant c. Conversely, if x is any non-constant solution of equation 
(13.20) then it satisfies the E-L equation. 
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It should be noted that equation (13.20) always has solutions of the form x = constant, but these 
solutions usually do not satisfy the corresponding E—L equation. They occur because of the factor x that 
appears on the right in equation (13.19). When overlooked, this glitch can give baffling results. Do not 
believe that constant solutions of equation (13.20) satisfy the E-L equation unless you have checked it 


directly! 


Our result is summarised as follows: 


A first integral of the E—L equation 
Suppose that F = F(x, x). Then any function that satisfies the E-L equation 


d (aF Lage 
dt \ ax ax 


also satisfies the first order differential equation 


i> = FF =e, (13.21) 


for some value of the constant c. Conversely, any non-constant solution of equation 
(13.21) satisfies the E-L equation. Constant solutions of equation (13.21) may or 
may not satisfy the E—L equation. 


Example 13.2 Finding extremals 2 


Find the extremal of the functional 
7 21/2 
1+; 
sixi= f Ss 
0 x 


that lies in x > O and satisfies the end conditions x(0) = 4 and x(7) = 3. [The 
restriction x > O ensures that the integrand does not become singular. ] 


Solution 


By definition, extremals are solutions of the E—L equation and, since ¢ is not explicitly 
present in this functional, we can use the integrated form (13.21). On substituting 
F = (14 x7)!/2/x into (13.21) and simplifying, we obtain 


x(1+%7)!? =c, 


where C is a constant; since x is assumed positive, C must be positive. This equation 
can be rearranged in the form 


C2 = 42/2 
pee cient ey 
Xx 


a pair of first order separable ODEs. 


378 


Chapter 13 The calculus of variations and Hamilton’s principle 


The solutions are* 
1/2 
+(c?- 17) = ped, 


where D is a constant of integration. Hence the extremals of J are (the upper halves 
of) the family of circles 


++ DS =C? 


in the (¢, x)-plane. On applying the given end conditions, we find that C = 5 and 
D = —3, so that the only admissible extremal is an arc of the circle with centre 
(3, 0) and radius 5, namely 


f= 40/164 6-2 (O<t<7). 


Since there is only one admissible extremal, it follows that, if it were known that 
a minimising (or maximising) function existed, then this must be it. However, we 
have no such knowledge and no means of deciding whether X provides a minimum or 
maximum for J, or neither. (It actually provides the global minimum of J.) & 


Our final example is the famous brachistochrone problem.‘ 


Example 13.3 The brachistochrone (shortest time) problem 


Two fixed points P and Q are connected by a smooth wire lying in the vertical plane 
that contains P and Q. A particle is released from rest at P and slides, under uniform 
gravity, along the wire to Q@. What shape should the wire be so that the transfer is 
completed in the shortest time? 


Solution 


Suppose that the wire lies in (x, z)-plane, with Oz pointing vertically downwards, 
with P at the origin, and Q at the point (a, b). Let the shape of the wire be given by 
the curve z = z(x). Then, since the particle is released from rest when z = 0, energy 
conservation implies that the speed of the particle when its downward displacement 
is z is (2g¢z)!/*. The time T taken for the particle to complete the transfer is therefore 


1/2 


a sey 
(hee Ht: (13.22) 


zi 


{z= ag)? f 
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* It is evident that these equations also admit the constant solution x = C. However, it may be verified that 
the E-L equation for this problem has no constant solutions. 

* This famous minimisation problem was posed in 1696 by Johann Bernoulli (who had already found 
the solution) as a not-so-friendly challenge to his mathematical contemporaries. Solutions were found 
by Jacob Bernoulli, de 1’H6pital, Leibnitz, and Newton, who (according to his publicity manager) had 
the answer within a day. Newton published his solution anonymously, but Johann Bernoulli identified 
Newton as the author declaring ‘one can recognise the lion by the marks of his claw’. The last word goes 
to Newton. He complained ‘J do not love to be pestered and teased by foreigners about mathematical 
things ...’. 
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where z means dz/dx. The problem is to find the function z(x), satisfying the end 
conditions z(0) = 0, z(a) = b, that minimises T. 

If x* minimises T, then it must make T stationary and so be an extremal of 
T. Since x is not explicitly present in this functional, we can use the integrated form 
(13.21) of the E-L equation. On substituting in F = (1+27)!/?/z!/? and simplifying, 
we obtain 


z(1+2) =2¢, 


where C is a positive constant. (The constant is called 2C for later convenience.) This 
equation can be arranged in the form 


IC an 
r= ( ) 
Zz 


a pair of first order separable ODEs. (The constant solution z = 2C is not an extremal 
of J and can be disregarded.) Integration gives 


. 1/2 
x=t - . 
x Ic) dz 


To perform the integral, we make the substitution* z = C(1 — cos w), in which case 


ae 1—cosw 2 : 
x=ac f (=) siny dw 


= 4c [ 2sin? by ay 
=+tC(y -—sinw)+D, 


where D is a constant of integration. Thus the extremals of J have the parametric 
form 


x =+C(w—-sinw)+D, z=C(1—-cosyw), 


with yw as parameter. Since the two choices of sign correspond only to a change of 
sign of the parameter, we may assume the positive choice. These curves are a family 
of cycloids’ with ‘radius’ C and shift D in the x-direction. 

Now we find the admissible extremals. The condition that z = 0 when x = 0 
implies that the shift constant D = 0. The radius C of the cycloid is then determined 
from the second end condition z = b when x = a. In general, C must be determined 
numerically but, in special cases, C may be found analytically. For example, if Q 
is the point (a, 0) (so that P and Q are on the same horizontal level) it is found that 
C =a/2z. A more typical case is shown in Figure 13.4. 


* It’s easy to spot the smart substitution when you already know the answer! 

* The cycloid is the path traced out by a point on the rim of a disk rolling on a plane; the ‘radius’ referred 
to is the radius of this disk. Since the E-L equation cannot be satisfied at a cusp, each extremal must lie 
on a single loop of the cycloid. 
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FIGURE 13.4 The curve that minimises T[z] is an arc of a cycloid. 


Since there is only one admissible extremal, it must be the minimising curve for 
T[z], provided that a minimising curve exists at all. In view of the physical origin of 
the problem, most of us are happy to take this for granted, but purists may sleep more 
soundly in the knowledge that this can also be proved mathematically. ml 


13.3. VARIATIONAL PRINCIPLES 


The laws of physics are usually formulated in terms of variables or fields that 
satisfy differential equations. Thus, the generalised coordinates q(t) of a mechanical sys- 
tem satisfy Lagrange’s equations (a system of ODEs), the electromagnetic field satisfies 
Maxwell’s equations (a system of PDEs), the wave function of quantum mechanics satis- 
fies Schrodinger’s wave equation, and so on. But there is an alternative way of expressing 
these laws in terms of variational principles. /n the variational approach, the actual 
physical behaviour of the system is distinguished by the fact that it makes a certain inte- 
gral functional stationary. Thus all of the physics is somehow contained in the integrand 
of this functional! 

Expressing physical laws in variational form does not make it any easier to solve 
problems. Indeed, problems will continue to be solved by using differential equations. 
The virtue of the variational formulation is that it is much easier to extend existing theory 
to new situations. For example, the theory of fields can be developed in the language of 
classical mechanics by using the variational formulation. 


Fermat’s principle 


These ideas are nicely illustrated by an example most readers will be familiar with: 
the paths of light rays in geometrical optics. When travelling through a homogeneous 
medium, light rays travel in straight lines. But, on meeting a plane interface between two 
different homogeneous media, the ray is either reflected or suffers a sharp change of direc- 
tion called refraction, as shown in Figure 13.5. This change of direction is governed by 
Snell’s law of refraction 1, sin@; = n2 sin @2, where n, and nz are the refractive indices 
of the two media, and 6; and 62 are the angles that the ray makes with the normal to the 
interface. In terms of the angle w used in Figure 13.5, Snell’s law takes the form 


n, Cos Wy = 7n2 COS W2 = 13.COS W3. 
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FIGURE 13.5 When a light ray passes between homogeneous media (left), it satisfies Snell’s law 
ny, cos Wi = nN2 cos W2 = n3 cos W3. In the continuous case with n = n(y) (right), Snell’s law 
becomes n cos yy = constant. 


x 


In the more general case of an inhomogeneous medium in which 7 varies continuously 
in the y-direction, one would then expect curved rays that satisfy Snell’s law in the form 


ncos yf = constant. 


A variational principle consistent with these rules was proposed by Fermat in 1657 
and became known as Fermat’s principle of least time. This stated that: 


Of all the possible paths that a light ray might take between two fixed points, the 
actual path is the one that minimises the travel time of the ray. 


Fermat showed that his principle implied the truth of the laws of reflection and refrac- 
tion (as well as predicting straight rays in a homogeneous medium). Fermat’s original 
principle is a beautifully simple and general statement about the paths taken by light rays 
but, sadly, it is not quite correct. The correct version is as follows: 


Fermat’s principle 


The actual path taken by a light ray between two fixed points makes the travel time 
of the ray stationary. 


The difference between the original and correct versions is that the path taken by the 
ray does not neccessarily make the travel time a minimum, but it does make the travel time 
stationary.* In practice, the travel time is usually a minimum, but there are exceptional 
cases where it is not. 

If the free-space speed of light is c, then the speed of light at a point of a medium 
where the refractive index is n is c/n. A (hypothetical) path P in the medium would 


* Surprisingly, incorrect statements about Fermat’s principle abound in the literature. It is often claimed 
that ‘the path of a light ray makes the travel time a minimum or (occasionally) a maximum’. This is 
untrue. The path of a ray can never make T a maximum. It usually makes T a minimum, occasionally it 
provides neither a minimum nor a maximum, but it never provides a maximum. 
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therefore be traversed in time T given by the line integral 
T[P] =o} nds. (13.23) 
P 


Since paths that make T stationary are extremals of T we can restate Fermat’s pinciple in 
the elegant form: 


Fermat’s principle — Classy version 


The paths of light rays in a medium are the same as the extremals of the functional 
T for that medium. 


Suppose that the refractive index n in the medium depends only on y (as in Figure 
13.5) and consider rays that lie in the (x, y)-plane. Then a ray that connects the points 
(xo, yo) and (x1, y,) must be an extremal of the functional T which, in Cartesian coordi- 
nates, takes the form 


x 1/2 
Tly]= ay n (1 +3) dx, (13.24) 
xo 
where y means dy/dx, and n = n(y). Since n does not depend upon x, we may use the 
integrated form (13.21) of the E—-L equation, which gives 
n 


If we write y = tan y, where w is the angle between the tangent to the ray and the x-axis 


= constant. 


(see Figure 13.5), then this equation becomes 
ncos w = constant, 


exactly as anticipated from Snell’s law for layered media. 


Question A puzzle 

When n = n(y), it is easy to verify that the straight lines y = constant are not 
extremals of T[y] and are therefore not rays (although they do satisfy Snell’s law!). 
But since such a ‘ray’ would experience a constant value of n, how does the ray know 
that it must bend? 


Answer 
Your physics lecturer will be pleased to answer this question. 


The variational approach really comes into its own however when we extend our the- 
ory to other inhomogeneous media, where the correct generalisation of Snell’s law is dif- 
ficult to spot. There is no such difficulty with the variational approach; Fermat’s principle 
still holds. The case of a light ray propagating in an axially symmetric medium is solved 
in Problem 13.9. From Fermat’s principle, this is quite straightforward, but, starting from 
Snell’s law, one would probably guess the wrong formula! 
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FIGURE 13.6 Hamilton’s principle Of all 
the kinematically possible trajectories of a 
system that connect the configurations 
q = qi and g = q2 in the time interval 
[t1, t2], the actual motion g*(t) makes the 
action functional of the system stationary. 


13.4 HAMILTON’S PRINCIPLE 


Hamilton’s principle is the variational principle that is equivalent to Lagrange’s 
equations of motion. The comparison with geometrical optics is that Hamilton’s principle 
corresponds to Lagrange’s equations as Fermat’s principle corresponds to Snell’s law. We 
consider first the special case of systems with one degree of freedom. 


Systems with one degree of freedom 


Consider a Lagrangian system with a single generalised coordinate g and Lagrangian 
L(q,q,t). Then the trajectory g*(t) is an actual motion of the system if, and only if, it 
satisfies the Lagrange equation 


d (aL aL _ 4 (13.25) 
dt \ aq aq. 


It is impossible not to notice that equation (13.25) is the Euler-Lagrange equation one 
would get by making stationary the functional S[g ] defined by 


t 
S[q] = / | L(q,q,t)dt. (13.26) 
to 


The scalar quantity S is called the action and the functional S[q | is called the action 
functional corresponding to the Lagrangian L (for the time interval [fo, f1]). 

From this simple observation, it follows that g*(t) is an actual motion of the system 
if, and only if, it makes the action functional S{q | stationary. The situation is as shown 
in Figure 13.6. This is Hamilton’s principle for a mechanical system with one degree of 
freedom. 


Example 13.4 Hamilton’s principle 


A certain oscillator with generalised coordinate g has Lagrangian 


Verify that g* = sint is a motion of the oscillator, and show directly that it makes the 
action functional $[q ] satationary in any time interval [0, T]. 
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Solution 
Lagrange’s equation corresponding to the Lagrangian L = 57° — sq" is 
Gtq=0. 


Since q* satisfies this equation, it is a motion of the oscillator. 
Let h(t) be an admissible variation. Then 


te 
Slq* +h] —-Slq*] = if [ (cose +h)? — (sint + hy? — cos? t + sin? | dt 
0 


T 
= if (2h cost +h? — 2h sint — h) de 
0 


T 
= [i cost | + if (i? = i?) dt 
0 0 
ic; 
= | (i? — 1?) at, 
0 


since h(O) = h(t) = 0. It follows that 
|Stq* +h] —Slq*]| < 51 (nsx |A(t)|> + max Wor) 
O<t<t O<t<t 


2 
<5t (nsx |h(t)| + max Wo) 
O<t<t O<t<t 
= 5t|lAll. 
Hence 
Stq* +h] — Stq*] = 0 (\IAlP), 


which, by definition, means that g* makes the action functional S[q ] stationary. [It 
may not make S[q ] a minimum.] 


Systems with many degrees of freedom 


Hamilton’s principle can be extended to systems with any number of degrees of freedom. 
In this more general case, the system has generalised coordinates g = (41, 92,---.9n). 
the Lagrangian has the form L = L(q, q, t), and Lagrange’s equations of motion are the 
n simultaneous equations 


d ( oL OL 
) =0 (l<j<n). (13.27) 
dt 0qj 0qj 


The action functional is now defined to be: 
Definition 13.3 Action functional The functional 
ty 
stai= | L(q,q,t)dt (13.28) 
t 


is called the action functional corresponding to the Lagrangian L(q,q,t) (for the time 
interval [to, t,]). 
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The notation S[q] is really a shorthand form for S[q1,q2,...,9¢n] so that now 
there are n functions that can be varied by the ” independent variations hj, h2,...,hn 
respectively. In the vector notation, such a variation is denoted by h, where h = 
(hy, h2,..., hy). The theory that we have developed does not cover the case where the 
functional has more than one ‘independent variable’, but it can be extended to do so. An 
outline of this extension is as follows: 


Consider the general situation in which 


b 
sixi= [ F(x, x,t) dt, 


where the vector function x(t) = (x1 (t), x2(f),...,Xn(¢t)). By using the same argument as before, the 
variation in J caused by the admissible* variation h in x* is found to be 


d (aF 
iie* pal = oe 2 Exe 9-2 (Fer i* 1) ]njdr+0 (iimP), 
J a | 


where |[h||2 = Ay ||2 + ||Ao||? fteeet An |2. This variation is linear in A with an error term of order 
||h||2. As before we say that x* makes J[x] stationary if the linear term is zero, leaving only the error 
term. 


Definition 13.4 Stationary J The vector function x*(t) is said to make the functional 
J [x] stationary if 


Jl x* +h) — J[x*] = 0 (IiziI?) 


when ||h|| is small. 


If x* makes the functional J[ x] stationary then 


ae i ae d (aF 
: —(x*, e*, 1) - — ( —(a*, 2*, 0) | [hj dt =0, 
‘aya Ox j dt \ 0x; ’ 


for all admissible variations h. By allowing each of the {x ;} to vary separately (while the others remain 
constant), the ‘bump function’ argument can be applied exactly as before to show that 


oF d | ae ere aa : 
= (x*, #*,) - < (#9) =0 (l<j<n). 
Ox j dt Xj 


This is the same as saying that x* must satisfy the simultaneous Euler-Lagrange equations 


ard (aF)_y dees 
_~— — | — J = Nn). 
dx; dt \ax; wr 


Our result is summarised as follows: 


* The vector variation h is admissible if h(a) = h(b) = 0, that is, if hy, ho, ..., hy are all admissible. 
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Euler-Lagrange equations with many variables 


The vector function x* makes the integral functional 


b 
iiea)= |] Fee Aidt 


stationary if, and only if, x* satisfies the simultaneous Euler-Lagrange differential 
equations 


d (0F oF 
—0 Il Ff cs , 
a(S) (lsjsn) 


This is a natural generalisation of the single variable theory and corresponds to the 
elementary result that a function of n variables f (x1, x2,..., Xn) has a stationary point if, 
and only if, all its first partial derivatives vanish at that point. 

The statement of Hamilton’s principle for systems with many degrees of freedom is 
therefore: 


Hamilton’s principle 


The trajectory q*(t) is an actual motion of a mechanical system if, and only if, q* 
makes the action functional of the system stationary. 


The only essential difference between this correct version of Hamilton’s principle and 
the original version (quoted at the beginning of the chapter) is that an actual motion of the 
system does not neccessarily make the action functional a minimum, but it always makes 
the action functional stationary.* In practice, the action functional is usually minimised, 
but there are exceptional cases where it is not (see Problem 13.11). 

As with Fermat’s principle, there is a classy version of Hamilton’s principle, which is 
less wordy and more satisfactory generally. It makes use of the concept of the extremals 
of J, which are simply solutions of the n simultaneous Euler-Lagrange equations. 


Hamilton’s principle — Classy version 


The actual motions of a mechanical system are the same as the extremals of its action 
functional. 


* 


Incorrect statements about Hamilton’s principle also abound in the literature. It is often claimed that ‘an 
actual motion the system makes the action functional a minimum or (occasionally) a maximum’. This 
is untrue. It is not possible to make S a maximum. The actual motion of the system usually makes S a 
minimum, occasionally it provides neither a minimum nor a maximum, but it never provides a maximum. 


13.4 Hamilton’s principle 387 


Significance of Hamilton’s principle 


Since Hamilton’s principle is equivalent to Lagrange’s equations, it can be regarded as the 
fundamental postulate of classical mechanics, instead of Newton’s laws,* for any mechan- 
ical system that has a Lagrangian. It should be emphasised that this is not a new theory 
— the Newtonian theory is correct — but an alternative route to the same results. Thus 
we can derive Lagrange’s equations of motion from the Newtonian theory (as we did) or, 
more directly, from Hamilton’s principle. Because Hamilton’s principle can be extended 
to apply to a wide range of physical phenomena while the Newtonian theory can not, 
Hamilton’s principle is regarded as the more fundamental. 

The problem with taking Hamilton’s principle as the fundamental postulate of clas- 
sical mechanics is that, had one not been exposed to the traditional treatment, one would 
have no idea what the Lagrangian ought to be for any particular system. To convince 
oneself of the difficulties involved, it is instructive to read Landau’s [6] ‘derivation’ of the 
Lagrangian for the simplest system imaginable — a single particle moving in free space. 
Indeed, it seems difficult to introduce the concept of mass convincingly at all. Neverthe- 
less, this is the route that must be followed when Hamilton’s principle is extended, for 
example, to particle physics. The Lagrangian has to be found by intelligent guesswork, 
and, in particular, by taking account of all the symmetries that are known to exist. 

Within classical mechanics itself, it may appear that Hamilton’s principle has told us 
nothing new. It says that the motions of a mechanical system are the same as the extremals 
of the action functional, that is, the motions satisfy Lagrange’s equations; this we already 
knew. However, because the equations of motion have the special form associated with 
variational principles, they can be shown to possess important properties that would be 
very difficult to prove directly. One example of this is the effect on the equations of 
motion of choosing a new set of generalised coordinates q’ = (q\; G5: .++5G,). The q’ are 
known functions of the old generalised coordinates q and vice versa. The direct approach 
would be to subject the Lagrange equations (13.27) to this general transformation of the 
coordinates and see what happens; the result would be a complicated mess. However, in 
the variational approach, one simply expresses the Lagrangian L as a function of the new 
variables, that is, L = L(q’, q’,t). Although L has a different functional form in terms of 
the coordinates q’, its values are the same as before, so that the new action functional 


ty 
Slq’]= / L(q', q',t) dt, 
t 


0 


takes the same values as the old, provided that q’(t) and q(t) refer to the same trajectory of 
the mechanical system. It follows that, if the trajectory g(t) makes S[q] stationary, then 
the corresponding trajectory g'(t) makes S[q’] stationary. Hence the extremals of S[q] 
map into the extremals of S[q'], and vice versa. It follows that the transformed equations 
of motion are just the same as the old ones with q replaced by q’. This fact is expressed 


* Actually, instead of the Second and Third Laws. The First Law is needed to ensure that the motion is 
observed from an inertial reference frame. 
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by saying that the Lagrange equations of motion are invariant under transformations 
of the generalised coordinates. This remarkable result clearly applies to any system of 
equations derived from a variational principle. This provides a general way of ensuring 
that any proposed set of governing equations should be invariant under a particular group 
of transformations (the Lorentz transformations, for instance). This will be so if the equa- 
tions are derivable from a variational principle whose ‘Lagrangian’ is invariant under the 
same group of transformations. 


Problems on Chapter 13 


Answers and comments are at the end of the book. 


Harder problems carry a star (). 


Euler-Lagrange equation 


13.1 Find the extremal of the functional 


2 42 
r= f — dt 
1 ¢ 


that satisfies x(1) = 3 and x(2) = 18. Show that this extremal provides the global minimum 
of J. 


13.2 Find the extremal of the functional 
a 
J[y] = / (2x sint — x7) dt 
0 
that satisfies x(0) = x (zt) = 0. Show that this extremal provides the global maximum of J. 


13.3 Find the extremal of the path length functional 


1 ty 2791/2 
Lly]= / 1+ (2) dx 
0 Xx 


that satisfies y(0) = y(1) = 0 and show that it does provide the global minimum for L. 


13.4 An aircraft flies in the (x, z)-plane from the point (—a, 0) to the point (a, 0). (z = 0 is 
ground level and the z-axis points vertically upwards.) The cost of flying the aircraft at height 
z is exp(—kz) per unit distance of flight, where k is a positive constant. Find the extremal for 
the problem of minimising the total cost of the journey. [Assume that ka < 1/2.] 


13.5% Geodesics on a cone Solve the problem of finding a shortest path over the surface 
of a cone of semi-angle a by the calculus of variations. Take the equation of the path in the 
form p = p(@), where p is distance from the vertex O and @ is the cylindrical polar angle 
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measured around the axis of the cone. Obtain the general expression for the path length and 
find the extremal that satisfies the end conditions p(—z/2) = p(m/2) =a. 

Verify that this extremal is the same as the shortest path that would be obtained by devel- 
oping the cone on to a plane. 


13.6 Cost functional A manufacturer wishes to minimise the cost functional 
4 
Clx] = / (G+ ii + 2x) ar 
0 


subject to the conditions x(0) = 0 and x(4) = X, where X is volume of goods to be produced. 
Find the extremal of C that satisfies the given conditions and prove that this function provides 
the global minimum of C. 

Why is this solution not applicable when X < 8? 


13.7 Soap film problem Consider the soap film problem for which it is required to minimise 


syi= f" »(14+3°)F as 


with y(—a) = y(a) = b. Show that the extremals of J have the form 
x 
y =ccosh (= +d) ; 
€ 


where c, d are constants, and that the end conditions are satisfied if (and only if) d = 0 and 


b 
cosha = (2) Xs 
a 


where A = a/c. Show that there are two admissible extremals provided that the aspect ratio 
b/a exceeds a certain critical value and none if b/a is less than this crirical value. Sketch a 
graph showing how this critical value is determined. 

The remainder of this question requires computer assistance. Show that the critical value 
of the aspect ratio b/a is about 1.51. Choose a value of b/a larger than the critical value 
(b/a = 2 is suitable) and find the two values of A. Plot the two admissible extremals on the 
same graph. Which one looks like the actual shape of the soap film? Check your guess by 
perturbing each extremal by small admissible variations and finding the change in the value of 
the functional J[y ]. 


Fermat’s principle 


13.8 A sugar solution has a refractive index n that increases with the depth z according to the 
formula 


z\ 1/2 
n=no (1+ =) ; 
a 


where no and a are positive constants. A particular ray is horizontal when it passes through 
the origin of coordinates. Show that the path of the ray is not the straight line z = 0 but the 
parabola z = x?/4a. 
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13.9 Consider the propagation of light rays in an axially symmetric medium, where, in a 
system of cylindrical polar co-ordinates (7, 6, z), the refractive index n = n(r) and the rays lie 
in the plane z = 0. Show that Fermat’s time functional has the form 


aT 1/2 
T[r]= Ee / n (-? + 2 dé, 
a) 


where r = 7(@) is the equation of the path, and7 means dr/dé@. 
(i) Show that the extremals of T satisfy the ODE 


nr? ait 
> = constant. 
(r2 + #2)1/2 
Show further that, if we write 7 = r tan yy, where y is the angle between the tangent to 


the ray and the local cylindrical surface , = constant, this equation becomes 
rncos yw = constant, 


which is the form of Snell’s law for this case. Deduce that circular rays with centre at 
the origin exist only when the refractive index n = a/r, where a is a positive constant. 


Hamilton’s principle 


13.10 A particle of mass 2 kg moves under uniform gravity along the z-axis, which points 
verically downwards. Show that (in SI units) the action functional for the time interval [0, 2] 
is 


S[z]= A (? r 202) dt, 


where g has been taken to be 10 ms~?. 


Show directly that, of all the functions z(t) that satisfy the end conditions z(0) = 0 and 
z(2) = 20, the actual motion z = 57? provides the /east value of S. 


13.11 A certain oscillator with generalised coordinate g has Lagrangian 
L= gq? _ 4q?. 


Verify that g* = sin 2t¢ is a motion of the oscillator, and show directly that it makes the action 
functional S[q ] satationary in any time interval [0, T]. 

For the time interval 0 < ¢t < z, find the variation in the action functional corresponding 
to the variations (i) h = € sin 4r, (ii) h = € sint, where € is a small parameter. Deduce that the 
motion g* = sin 2t does not make $ a minimum or a maximum. 


13.12 A particle is constrained to move over a smooth fixed surface under no forces other than 
the force of constraint. By using Hamilton’s principle and energy conservation, show that the 
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FIGURE 13.7 The path of quickest descent from P to 
Q in Cosine Valley. Those who lose their nerve at the 


summit can walk down by the shortest path (shown 
dashed). 


path of the particle must be a geodesic of the surface. (The term geodesic has been extended 
here to mean those paths that make the length functional stationary). 


This result has a counterpart in the theory of general relativity, where the concept of force does not exist 


and particles move along the geodesics of a curved space-time. 


13.13 By using Hamilton’s principle, show that, if the Lagrangian L(q, q, t) is modified to 
L' by any transformation of the form 


d t ; ; 
then the equations of motion are unchanged. 


Computer assisted problems 


13.14 Geodesics on a paraboloid Solve the problem of finding the shortest path between 
two points P(0, 1, 1) and Q(0, —1, 1) on the surface of the paraboloid z = x A y. 


Let C be a path lying in the surface that connects P and Q. Show that the length of C is given by 


iirl= i Ga me (1 +4r7) py" do, 
iat 


where r = r(Q) is the polar equation of the projection of C on to the plane z = 0, and 7 means dr/dé. 
Now find the function 7(@) that minimises L. It is easier to work directly with the second order E—L- 
equation (which can be found with computer assistance). Solve the E-L-equation numerically with the 
initial conditions (0) = A, (0) = 0 and choose A so that the path passes through P (and, by symmetry, 
Q). Plot the shortest path using 3D graphics. 
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13.15% The downhill skier Solve the problem of finding the path of quickest descent for 
a skier from the point P (xo, yo, Zo) to the point Q(x, y1, Z1) on a snow covered mountain 
whose profile is given by z = G(x, y), where G is a known function. [Assume that the skier 
starts from rest and that the total energy of the skier is conserved in the descent.] 


Let C be a path connecting P and Q. Show that the time taken to descend by this route is given by 


: : «\2\ 1/2 
m (1452+ (G.x+G,y3) 
Tiyl= ey? [ 1 ( +y + ( x + yy) dx 


x0 G(x0, yo) — G(x, y) 


where y = y(x) is the projection of C on to the plane z = 0, y means dy/dx, and G, x, G, y are the 
partial derivatives of G with respect to x, y. Obtain the E—L equation with computer assistance and solve it 
numerically with the initial conditions y(x1) = yy, y’(x,) = 4 and choose A so that the path passes through 
the P. [The numerical ODE integrator finds it easier to integrate the equation starting from the bottom. 
Why?] Plot the quickest route using 3D graphics. The author used the profile of the Cosine Valley resort, 
for which G(x, y) = cos? (x /2) cos” (zy/4). The skier had to descend from P (1/3, 0, 3/4) to Q(2, 2, 0). 
The computued quickest route down the valley is shown in Figure 13.7. Those who lose their nerve at the 
summit can walk down by the shortest route (shown dashed). You may make up your own mountain profile, 


but keep it simple. 


Chapter Fourteen 
SSS SS SSS SSS ane 


Hamilton’s equations and phase space 


KEY FEATURES 
The key features of this chapter are the equivalence of Lagrange’s equations and Hamilton’s 
equations, Hamiltonian phase space, Liouville’s theorem and recurrence. 


In this chapter we show how Lagrange’s equations can be reformulated as a set of first 
order differential equations known as Hamilton’s equations. Nothing new is added to the 
physics and the Hamilton formulation is not superior to that of Lagrange when it comes 
to problem solving. The value of Hamilton’s supremely elegant formulation lies in pro- 
viding a foundation for theoretical extensions both within and outside classical mechan- 
ics. Within classical mechanics, it is the basis for most further developments such as the 
Hamilton-Jacobi theory and chaos. Elsewhere, Hamiltonian mechanics provides the best 
route to statistical mechanics, and the notion of the Hamiltonian is at the heart of quantum 
mechanics. As applications of the Hamiltonian formulation, we prove Liouville’s theorem 
and Poincaré’s recurrence theorem and explore some of their remarkable consequences. 


14.1 SYSTEMS OF FIRST ORDER ODES 


The standard form for a system of first order ODEs in the 7 unknown functions 
xX] (t), ea); ae Xn(t) is 


xy = F\ (x4, x2, ee sXnat)s 

x2 — F2(x1, x2, eee | Xn t), 
: . (14.1) 

Xp = Fy(X1, X2, ia sdngt)s 
where F|, F2,..., Fy are given functions of the variables x1, x2, ..., Xn, ¢. This can also 

be written in the compact vector form 

x = F(x,f), (14.2) 
where x and F are the n-dimensional vectors x = (xX],.%,...,X,) and F = 
(F\, F2,..., Fy). If the value of x is given when t = fo, the equations x = F(x, fr) 


determine the unknowns x(f) at all subsequent times. 
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A typical example is the predator-prey system of equations 


Xx, = ax, — bx1Xx2, 


X2 = bx1x2 — cx2, 


which govern the population density x(t) of the prey and the population density x2(t) of 
the predator. In this case, Fy = ax, — bx 1x2, and F7 = bx,x2 — CX. 

Converting higher order equations to first order 

Higher order ODEs can always be converted into equivalent systems of first order ODEs. 
For example, consider the damped oscillator equation 


X+3x+4x =0. (14.3) 


If we introduce the new variable v, defined by v = x, then the second order equation 
(14.3) for x(t) can be converted into the pair of first order equations 


x=Uv 


v= —3x — 40, 


in the unknowns {x(t), v(t)}. Since this step is reversible, this pair of first order equations 
is equivalent to the original second order equation (14.3). More generally: 


Any system of n second order ODEs in n unknowns can be converted into an equivalent 
system of 2n first order ODEs in 2n unknowns. 


Consider, for example, the orbit equations for a particle of mass m attracted by the grav- 
itation of a mass M fixed at the origin O. In terms of polar coordinates centred on O, the 
Lagrangian is 


: MG 
= sm G +176?) + ein 
1 
and the corresponding Lagrange equations are 
. MG : ss 
i—r0 = as 2rr6 +r76 = 0, 
z 


a pair of second order ODEs in the unknowns {7 (¢), @(¢)}. If we now introduce the new 
variables v, and vg, defined by 


Ur =F, up = 6, 
the two second order Lagrange equations can be converted into 


r= 0y, v= rvs — MG/r?, 
6= ve, vo = —2v;v9/T, 


an equivalent system of four first order ODEs in the four unknowns {r, 0, v;, ve}. 
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Hamilton form 


In the above examples, we have performed the conversion to first order equations by intro- 
ducing the coordinate velocities as new variables. This is not the only choice however. 
When transforming a system of Lagrange equations to a first order system, we could 
instead take the congugate momenta, defined by 


OL 


=—, (14.4) 
0g j 


Pj 
as the new variables. This seems quite attractive since the Lagrange equations already 
have the form 


_ ab 


= >— (i<j sn). 
0qj 


Pj 
The downside is that the right sides 0L./0qj; are functions of g, q, t, and must be trans- 
formed to functions of q, p, t. This is achieved by inverting the equations (14.4) to express 
the q as functions of q, p, ft. 
For the Lagrangian in the orbit problem, the conjugate momenta are 
OL : OL 


22 
= —_—= Mr, = — =mr’é, 
Pr : Po 96 


and these equations are easily inverted” to give 


. : ; 6 
— Pr A 0 => ae 2° 
m mr 


The two second order Lagrange equations for the orbit problem are therefore equivalent 
to the system of four first order ODEs 


_ Pr —- Pa 7 mMG 
Mm? "m3 r2? 
6 — eae Po = 0, 
mr 


in the four unknowns {r, 6, p,, po}. This is the Hamilton form of Lagrange’s equations 
for the orbit problem. 


Why bother? 


The reader is probably wondering what is the point of converting Lagrange’s equations 
into a system of first order ODEs. For the purpose of finding solutions to particular prob- 
lems, like the orbit problem above, there is no point. Indeed, the new system of first order 


* This step is difficult in the general case where there are n coupled linear equations to solve for |, ..., Gn- 
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equations may be harder to solve than the original second order equations. The real interest 
lies in the structure of the general theory. When Lagrange’s equations are expressed in 
Hamilton form* in a general manner, the result is a system of first order equations of 
great simplicity and elegance, now known as Hamilton’s equations. These equations are 
the foundation of further developments in analytical mechanics, such as the Hamilton- 
Jacobi theory and chaos. Also, the Hamiltonian function, which appears in Hamilton’s 
equations, is at the heart of quantum mechanics. 


14.2 LEGENDRE TRANSFORMS 


The general problem of converting Lagrange’s equations into Hamilton form 
hinges on the inversion of the equations that define p, namely, 


i ~ 14g,4.1) (ls jsn), (14.5) 
Og j 

so as to express q in terms of g, p, t. This inversion is made easier by the fact that 

the {p;} are not general functions of g, q and t, but are the first partial derivatives of a 

scalar function, the Lagrangian L(q, q, t). Itis a remarkable consequence that the inverse 

formulae can be written in a similar way.’ The details of the argument follow below and 

the results are summarised at the end of the section. 


The two-variable case 


We develop the transformation theory for the case of functions of two variables. This has 
all the important features of the general case but is much easier to follow. Suppose that v1 
and v2 are defined as functions of the variables uw; and u2 by the formulae 


OF OF 


= —, = —, 14.6 
Ou, = Ou2 ( ) 


UI 


where F(u1,u2) is a given function of uw; and wz. Is it possible to write the inverse 
formulae* in the form 


0G dG 
uj =>—, uz = >—, (14.7) 

dv 0v2 
* This is the form in which the new variables are the conjugate momenta pj, ..., Pn. The form in which the 
new variables are the generalised velocities 1, ..., Gn does not lead to an elegant theory, and is therefore 


not used. It is often claimed that it is not possible to take the generalised velocities as new variables 
because ‘they are the time derivatives of the generalised coordinates and therefore cannot be independent 
variables’. This objection is baseless, as the previous examples show. Indeed, if this objection had any 
substance, the conjugate momenta would be disqualified as well! 
* There is a neat way of seeing that this must be true, which may appeal to mathematicians. If v = 
grad, F (uw), then the Jacobian matrix of the transformation from u to v is symmetric. It follows that the 
Jacobian matrix of the inverse tranformation must also be symmetric, which is precisely the condition 
that the inverse transformation has the form u = grady G(v). 
We will always suppose that the inverse transformation does exist. 


He 
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for some function G(vj, v2)? In the simplest cases one can answer the question by grind- 
ing through the details directly. For example, suppose F = 2ut + 3uju2 + ue Then 


vy = 4u; + 3u2, 
v2 = 3u, + 2u2. 


The inverse formulae are easily obtained by solving these equations for u;, v2, which 
gives 


uy = —2v, + 3v2, 


u2 = 3v, — 4v2. 


There is no prior reason to expect that these formulae for u1, u2 can be expressed in 
terms of a single function G(v1, v2) in the form (14.7), but it is true because the right 
sides of these equations happen to satisfy the necessary consistency condition.* Simple 
integration then shows that (to within a constant) 


G= ye + 3u,v2 — Que. 


This result is not a coincidence. Let F (u;, u2) now be any function of the variables u1, 
u2, and suppose that a function G(v1, v2) satisfying equation (14.7) does exist. Consider 
the expression 


X = F(uj,u2) +G(vy, v2) — (uj vy + u202), 


which, as it stands, is a function of the four independent variables uj, u2, vj, v2. Suppose 
now that, in this formula, we imagine’ that v; and v2 are replaced by their expressions in 
terms of uw; and uz. Then X becomes a function of the variables uw; and 2 only. Its partial 
derivative with respect to u;, holding u2 constant, is then given by 


ox OF & dv; OG | ( dv} =) 


x x — — 
Ouy uy dv, Ou, OUn——s«CODdY el HO 


oF i dG dv} 4 dG dv2 
=|{——-v u u 
Ouy i dv : Ou dv2 7 Ou 


=0+0+0=0, 


* Tfuy = f1 (vq, v2) and uz = fo(v1, v2) then it is possible to express uj, uz in the form (14.7) only if the 
functions f; and f» are related by the formula 


afi _ af 
dv2 avy” 


which is called the consistency condition. 

+ Pure mathematicians strongly object to such feats of imagination. Unfortunately, the alternative is to 
introduce a welter of functional notation which obscures the essential simplicity of the argument. We 
will make frequent use of such ‘imagined’ substitutions. 
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on using first the chain rule and then the formulae (14.6) and (14.7). Hence X is indepen- 
dent of the variable u;. In exactly the same way we may show that 0X /du2 = 0 so that 
X is also independent of w2. It follows that X must be a constant! This constant can be 
absorbed into the function G without disturbing the formulae (14.7), in which case X = 0. 
We have therefore shown that if F and G are related by the equations (14.6) and (14.7), 
then they must* satisfy the relation 


F(uy,u2) + Gv, v2) = uy vy + u202. (14.8) 


The above argument is reversible so the converse result is also true. We have therefore 
shown that: 


The required function G(v1, v2) always exists and can be generated from the function 
F (uy, u2) by the formula 


G(vq, v2) = (ujvy + u2v2) — F(uy, U2) (14.9) 


where u, and uz are to be replaced by their expressions in terms of v and v2. 


It is evident that the relationship between the functions F' and G is a symmetrical one. 
Each function is said to be the Legendre transform of the other. 


Example 14.1 Finding a Legendre transform 


Find the Legendre transform of the function F (u;, u2) = 2ut +3u,;u2+ us by using 


the formula (14.9). 

Solution 

For this F, vy = 0F /du, = 4u, + 3u2 and v2 = OF /du2 = 3u, + 2u2. The inverse 
formulae are wy = —2v, + 3v2 and uz = 3v; — 4v2. From equation (14.9), the 


function G is given by 


G = u,v, + u2v2 — F (uy, u2) 
= (—2v, + 3v2)v1 + Bu, — 4u2)v2 — F(—2v1 + 3v2, 3v1 — 42) 
= —20? + 6u1 v2 — 4u3 — 
(2(-201 + 3v2)? + 3(—2v; + 3v2)(3v1 — 4v2) + Bui — 4v2)*) 
= —v; + 3u, v2 — 2v3, 


the same as was obtained directly. This is the Legendre transform of the given 
function F’. & 


* As we have seen, this may require a constant to be added to the function G. 
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Active and passive variables 


The variables uw = (uj, U2) and v = (vj, v2) are called active variables because they are 
the ones that are actually transformed. However, the functions F and G may also depend 
on additional variables that are not part of the transformation as such, but have the status 
of parameters. These are called passive variables. In the dynamical problem, q and p are 
the active variables and g is the passive variable. We need to find how partial derivatives 
of F and G with respect to the passive variables are related. 

Suppose then that F = F(uj,u2,w) and G = G(v, v2, w) satisfy the formulae 
(14.6) and (14.7), where w is a passive variable. Then (14.6) defines vj, v2 as functions 
of v1, v2 and w, and (14.7) defines v1, v2 as functions of vj, v2 and w. The argument 
leading to the formula (14.8) still holds so that 


F (uj, u2, w) + G(vq, v2, w) = uy vy + u202. (14.10) 


In this formula, imagine that v; and v2 are replaced by their expressions in terms of 1, 
uz and w; then differentiate the resulting identity with respect to w, holding u, and u2 
constant. On using the chain rule, this gives 

OF & dv, OG dv2 =< dv} dv2 


aE aoe ee i ee ee = uy —— + u2—, 
aw dv] dw dv2.—s OW dw w 


which can be written 


OF ¢ dG dG \ Ov ‘e dG \ dv2 04+0=0 

— — = ht >S — LS —— —— |; =U, 

dw ow : dv, / Ow dv2/ Ow 
on using the relations (14.7). Hence the partial derivatives of F(uj,u2,w) and 
G(v1, v2, w) with respect to w are related by 


—— (14.11) 


This is the required result; it holds for each passive variable w. 


The general case with many variables 


The preceding theory can be extended to any number of variables. The results are exactly 
what one would expect and are summarised in the box below. This summary is presented 
in a compact vector form using the n-dimensional ‘grad’.* It is a good idea to write these 
results out in expanded form. 


* If F = F(u, w), where u = (uy, u2,...,Un) and w = (wi, W2,..., Wm), then grad, F and grady F 
mean 


OF OF oF 
ouy ; ou2 ‘ : dun 


OF OF OF 
grady F(u, w) = 


dw, dw | Owm 


) ; grady F(u, w) = ( 
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Legendre transforms 


Suppose that the variables v = (v1, v2, ..., U,) are defined as functions of the active 
variables u = (U1, U2,..., Up) and passive variables w = (w 1, W2,..., Wm) by the 
formula 

v = grad, F(u, w), (14.12) 


where F is a given function of wu and w. Then the inverse formula can always be 
written in the form 


u = grady G(v, w), (14.13) 
where the function G(v, w) is related to the function F (uv, w) by the formula 
G(v,w)=u-v—F(u, w), (14.14) 
where w+ v = Uyvy +U2QU2 +--+ + UnUn. 
Furthermore, the derivatives of F and G with respect to the passive variables 
{w} are related by 


grady F(u, w) = — grady G(v, w). (14.15) 


The relationship between the functions F and G is symmetrical and each is said to 
be the Legendre transform of the other. 


14.3. HAMILTON’S EQUATIONS 


Let S be a Lagrangian mechanical system with n degrees of freedom and gener- 


alised coordinates g = (41, 91,.--;4n). Then the Lagrange equations of motion for S 
are 
= n), 4. 
dt \0qj qj == 


where L = L(q,q,t) is the Lagrangian of S. This is a set of n second order ODEs in 
the unknowns q(t) = (gi (t), g2(t), ---, Gn(t)). We now wish to convert these equations 
into Hamilton form, that is, an equivalent set of 27 first order ODEs in the 27 unknowns 
q(t), p(t), where p(t) = (pi(t), po(t),..-, Pn(t)), where the {p;} are the generalised 
momenta of S. The {p;} are defined by 


_ aL 


=— (d<j<n), (14.17) 
qj 


Pj 
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which can be written in the vector form 
p = gradg L(q,q,t). (14.18) 


The first step is to eliminate the coordinate velocities q from the Lagrange equations 
in favour of the momenta p. This in turn requires that the formula (14.18) must be inverted 
so as to express q in terms of qg, p and t. This is precisely what Legendre transforms do. 
It follows from the theory of the last section that the inverse formula to (14.18) can be 
written in the form 


q = grad y H(q, pt), (14.19) 


where the function H(q, p,t) is the Legendre transform of the Lagrangian function 
L(q,q,t). Here, g and p are the active variables and q is the passive variable. 


Definition 14.1 Hamiltonian function The function H (q, p, t), which is the Legendre 
transform of the Lagrangian function L(q,q,t), is called the Hamiltonian function of 
the system S. 


Since the functions H and L are Legendre transforms of each other, they satisfy the 


H(q, p.t)=q-p—L(q.q.t) (14.20) 


which can be used to generate H from L, and 


relations 


gradg L(q, 9,1) = — gradg H(q, p, t), (14.21) 


which connects the derivatives of L and H with respect to the passive variables. 
It is now quite easy to perform the transformation of Lagrange’s equations. The 
Lagrange equations (14.16) can be written in terms of the generalised momenta {p ;} in 


the form 
; OL Gere 
Pj =>. SJ Ss 
ia dj 
which is equivalent to the vector form 
p= gradg L(q, q,t). (14.22) 


The right sides of these equations still involve qg, but, on using the formula (14.21), we 
obtain 


p=— gtadg H(q, p,t). (14.23) 


These are the transformed Lagrange equations! The Hamilton form of the Lagrange equa- 
tions therefore consists of equations (14.23) together with equations (14.19), which effec- 
tively define the generalised momentum p. 
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All of the above argument is reversible and so the Hamilton form and the Lagrange 
form are equivalent. Our results are summarised below: 


Hamilton’s equations 


The n Lagrange equations (14.16) are equivalent to the system of 2n first order 
ODEs 


y= grad p H(q, p,t), p=- gradg H(q, p,t), (14.24) 


where the Hamiltonian function H(q, p,t) is the Legendre transform of the 
Lagrangian L(q, q,t) and is generated by the formula (14.20). This is the vector 
form of Hamilton’s equations.* The expanded form is 


oH oH 


He 


We have shownthat the n second order Lagrange equations in the 7 unknowns q(t) 
are mathematically equivalent to the 2” first order Hamilton equations in the 2n unknowns 
q(t), p(t). In each of these formulations of mechanics, the motion of the system is deter- 
mined by the form of a single function, the Lagrangian L(q, q, t) in the Lagrange formu- 
lation, and the Hamiltonian H(q, p,t) in the Hamilton formulation. Hamilton’s equa- 
tions are a particularly elegant first order system in which the functions F), F2,... that 
appear on the right are simply the first partial derivatives of a single function, the Hamil- 
tonian H. Moreover these right hand sides also satisfy the special condition’ div F = 0, 
which allows Liouville’s theorem to be applied to Hamiltonian mechanics.* 


Explicit time dependence 


One final note. When the Lagrangian has an explicit time dependence, this ¢ has the status 
of an extra passive variable. It follows that we then have the additional relation 


aoe (14.25) 


* After Sir William Rowan Hamilton, whose paper Second Essay on a General Method in Dynamics was 
published in 1835. Hamilton’s equations are sometimes called the canonical equations; no one seems to 
know the reason why. 

+ The scalar quantity div F is defined by 


. OF, OF) OF y 
div F = ——- + —+}.-. 
Ox 0x2 OXn 


* None of these statements is true if Lagrange’s equations are expressed as a first order system by taking 
the coordinate velocities q as the new variables. 
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which shows that if either of L or H has an explicit time dependence, then so does the 
other. 


Example 14.2 Finding a Hamiltonian and Hamilton’s equations 


Find the Hamiltonian and Hamilton’s equations for the simple pendulum. 


Solution 
The Lagrangian for the simple pendulum is 
292 


| sma +mgacos6, 


where @ is the angle between the string and the downward vertical, m is the mass of 
the bob, and a is the string length. The momentum pg conjugate to the coordinate 0 
is given by 


rr mao 
00 


Pe 


and this formula is easily inverted to give 


p=. (14.26) 
ma 
The Hamiltonian H is then given by 
H =6po—-L, 


where 6 is given by equation (14.26). This gives 


2 
H= (=) Pe - 5ma* (=) — mgacosé 
maz maz 
Pp 


~ Ima2 


—mgacosé. 


This is the Hamiltonian for the simple pendulum. From H we can find Hamilton’s 
equations. They are 


0H Pe 
6 = — = —., 
Ope maz 
‘ oH in 
= -— = —mega sing. 
Pe 36 & 


These are Hamilton’s equations for the simple pendulum. 

This simple example illustrates clearly why Lagrange’s equations are preferred 
over Hamilton’s equations for the practical solution of problems. To solve Hamilton’s 
equations in this case, we would differentiate the first equation with respect to ¢ and 
then use the second equation to eliminate the unknown pg. This gives 


6+ = sing =0, 
a 


which is precisely the Lagrange equation for the system! 
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Properties of the Hamiltonian H 


The Hamiltonian function H(q, p,t) has been defined as the Legendre transform of 
L(q,q,t) and, as such, it can be generated by the formula (14.20). We have met this 
expression before. It is identical to the energy function / 


n 
h=)°4;pj -L@.4.0), (14.27) 
j=l 
defined in section 12.8. The only difference between / and H is that the functional form 
of H is vital. H must be be expressed in terms of the variables q, p, t. On the other hand, 
since only the values taken by / are significant, its functional form is unimportant and it 
may be expressed in terms of any variables. However, since the values taken by H and h 
are the same, the results that we obtained in section 12.8 concerning / must also be true 
for the Hamiltonian H. In particular, when H has no explicit time dependence, H is a 
constant of the motion.* This result can also be proved independently, as follows. 
Suppose that H = H(q, p) and that {q(¢), p(t)} is a motion of the system. Then, in 
this motion, 


n 


dH (aH "\ 0H dH 
“ER Ge) SiC) 
— 9q; \ OP; Opj \ oq; 


j=l 


where the first step follows from the chain rule and the second from Hamilton’s equations. 
Hence H remains constant in the motion. 

Systems for which H = H(q, p) are said to be autonomous. (This term was previ- 
ously applied to systems for which L = L(q, q), but equation (14.25) shows that these 
two classes of systems are the same.) The above result can therefore be expressed in the 
form: 


Autonomous systems conserve H 


In any motion of an autonomous system, the Hamiltonian H (q, p) is conserved. 


In addition, when S is a conservative standard system, the Hamiltonian H can be 
expressed in the simpler form 


H(q, p)=T(q, p)+V@) (14.28) 


* As we remarked earlier, H has an explicit time dependence when L does; the circumstances under which 
this occurs are listed in section 12.6. 
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where T (q, p) is the kinetic energy of the system expressed in terms of the variables q, p. 
In this case, H is simply the total energy of the system, expressed in terms of the variables 
q, p. This is the quickest way of finding H when the system is conservative. 


Example 14.3. Finding a Hamiltonian 2 


Find the Hamiltonian for the inverse square orbit problem considered earlier and 
deduce Hamilton’s equations for this system. 


Solution 


This is a conservative system so that H = T + V. With the polar coordinates r and 
@ as generalised coordinates, T and V are given by 


: MG 
T= sm (? +176?) —— ae ; 
- 
and the generalised momenta are given by 
aL . aL 2: 
Pr = =m, Pe = — =r. 


OF a6 
These equations are easily inverted to give 


Pr 6 Po 


H 


II 
laa | 
+ 
x 
II 


This is the required Hamiltonian. Hamilton’s equations are now found by using this 
Hamiltonian in the general equations (14.25). The partial derivatives of H are 


dH py dH pe dH Po aH _ 4 
apr om’ dpe mr2’ drm}? 00 


and Hamilton’s equations for the orbit problem are therefore 


_ Pr Le PA _ mMG 
~~ m? _ mr3 ro” 
6 = Pe Po = 0. 
mr 


Naturally, these are the same equations as were obtained earlier by ‘manual’ trans- 
formation of Lagrange’s equations. As in the last example, solution of the Hamilton 
equations by eliminating the momenta simply leads back to Lagrange’s equations. 
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Momentum conservation 
From the Hamilton equation p; = —0H/0q;, it follows that: 


If 0H /dq; = 0 (that is, if the coordinate qj; is absent from the Hamiltonian), then the 
generalised momentum pj is constant in any motion. 


The corresponding result in the Lagrangian formulation is that: 


If OL /0qj; = 0 (that is, if the coordinate qj; is absent from the Lagrangian), then the 
generalised momentum pj is constant in any motion. 


These two results seem slightly different, but they are equivalent, since, from equation 
(14.21), 0H/dqg; = —dL/0q;. This means that the term cyclic coordinate, by which 
we previously meant a coordinate that did not appear in the Lagrangian, can be applied 
without ambiguity to mean that the coordinate does not appear in the Hamiltonian. Our 
result is then: 


Conservation of momentum 


If g; is a cyclic coordinate (in the sense that it does not appear in the Hamiltonian), 
then p;, the generalised momentum conjugate to q;, is constant in any motion. 


14.4 HAMILTONIAN PHASE SPACE ((q, p)—space) 


Suppose the mechanical system S has generalised coordinates g, conjugate 
momenta p, and Hamiltonian H(q, p,t). If the initial values of g and p are known,* 
then the subsequent motion of S, described by the functions {qg(t), p(t)}, is uniquely 
determined by Hamilton’s equations. This motion can be represented geometrically by 
the motion of a ‘point’ (called a phase point) in Hamiltonian phase space. Hamilto- 
nian phase space is a real space of 2n dimensions in which a ‘point’ is a set of values 
(91, 92,+++s9n> P1, P2,+++s Pn) Of the independent variables {g, p}. (Note that a point in 
Hamiltonian phase space represents not only the configuration of the system S but also its 
instantaneous momenta. This is the distinction between Hamiltonian phase space, which 
has 2n dimensions, and Lagrangian configuration space, which has n dimensions.) Each 
motion of the system S then corresponds to the motion of a phase point through the phase 
space." 

The only case in which we can actually draw the phase space is when S has just one 
degree of freedom. Then the phase space is two-dimensional and can be drawn on paper. 


* We are more familiar with initial conditions in which q and q are prescribed. However, these conditions 
are equivalent to those in which the initial values of g and p are prescribed. 

* Tt should be noted that Hamiltonian phase space is generally not the same as the phase space introduced 
in Chapter 8, which, in the present notation, is (g, q)-space. In particular, our next result (Liouville’s 
theorem) does not apply in (qg, q)-space. 
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FIGURE 14.1 Typical paths in the phase space (q, p) corresponding to 
motions of a system S with Hamiltonian H = p* + q?/9. The arrows 
show the direction that the phase point moves along each path as ¢ 
increases. 


Example 14.4 Paths in phase space 


Suppose that S has the single coordinate g and Lagrangian 
+2 2 
L= a ae 
4 9 
Find the paths in Hamiltonian phase space that correspond to the motions of S. 


Solution 
The conjugate momentum p = 0L/dq = 54, and the Hamiltonian is 
I @? @? 
H =4p-L=(Qp)p-7Qp) +> =P +5: 


Hamilton’s equations for S are therefore 
q = 2p, p = —2q/9 
On eliminating p, we find that g satisfies the SHM equation 
gq + (4¢/9) =0. 
The general solution of the Hamilton equations for S is therefore 
q = 3Acos((2t/3) +a), p = —Asin((2t/3) + a), 


where A and @ are arbitrary constants. These are the parametric equations of the 
paths in phase space, the parameter being the time f; each path corresponds to a 
possible motion of the system S. Some typical paths are shown in Figure 14.1. For 
this system , every motion is periodic so that the paths are closed curves in the (g, p)- 
plane. (They are actually concentric similar ellipes.) The arrows show the direction 
that the phase point moves along each path as ¢ increases. Hl 
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Of course, most mechanical systems have more than one degree of freedom so that 
the corresponding phase space has dimension four or more and cannot be drawn. If the 
system consists of a mole of gas molecules, the dimension of the phase space is six times 
Avogadro’s number! Nevertheless, the notion of phase space is still valuable for we can 
still apply geometrical reasoning to spaces of higher dimension. This will not solve 
Hamilton’s equations of motion, but it does enable us to make valuable predictions about 
the nature of the motion. 


The phase fluid 


The paths of phase points have a simpler structure when the system is autonomous, that 
is, H = H(q, p). In this case, H is a constant of the motion, so that each phase path 
must lie on a ‘surface’* of constant energy’ within the phase space. Thus the phase space 
is filled with the non-intersecting level surfaces of H, like layers in a multi-dimensional 
onion, and each phase path is restricted to one of these level surfaces. 

For autonomous systems, there can only be one phase path passing through any point 
of the phase space. The reason is as follows: suppose that one phase point is at the point 
(Go, Po) at time f;, and another phase point is at (qo, Po) at time t2. Then, since H is 
independent of ¢, the second motion can be obtained from the first by simply making the 
substitution ¢ > t +f, — #2, a shift in the origin of time. Therefore the two phase points 
travel along the same path with the second point delayed relative to the first by the constant 
time f2 — tj. Hence phase paths cannot intersect. This means that the phase space is 
filled with non-intersecting phase paths like the streamlines of a fluid in steady flow. Each 
motion of the system S corresponds to a phase point moving along one of these paths, just 
as the real particles of a fluid move along the fluid streamlines. The 21-dimensional vector 
quantity u = (q, p) has the réle of the fluid velocity field u(r)* and Hamilton’s equations 
serve to specify what this velocity is at the point (q, p) of the phase space. Because of 
this analogy with fluid mechanics, the motion of phase points in phase space is called the 
phase flow. 


14.5 LIOUVILLE’S THEOREM AND RECURRENCE 


Consider those phase points that, at some instant, occupy the region Ro of the 
phase space, as shown in Figure 14.2. As ¢ increases, these points move along their 
various phase paths in accordance with Hamilton’s equations and, after time f, will occupy 
some new region R, of the phase space. This new region will have a different shape® 


* 


If the phase space has dimension six, then a ‘surface’ of constant H has dimension five. This is therefore 
a generalisation of the notion of surface, which normally has dimension two. 

This ‘energy’ is the generalised energy H (q, p). For a conservative system, it is equal to the actual total 
energy 7+ V. 

u(r) is the velocity of the fluid particle instantaneously at the point with position vector r. 

3 Since the motion of many systems is sensitive to the initial conditions, the shape of ; can become very 
weird indeed! 


+ 
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FIGURE 14.2 Liouville’s theorem: the Hamiltonian phase 
flow preserves volume. 


to Ro, but Liouville’s theorem” states that the volumes’ of the two regions are equal. 
This remarkable result is expressed by saying that the Hamiltonian phase flow preserves 
volume. The theorem is easy to apply, but the proof is rather difficult. 


Proof of Liouville’s theorem 

The proof is easier to follow if we use x1, x2, ..., 2, as the names of the variables (instead of g, p), and 
also call the right sides of Hamilton’s equations F,, F>,..., F,. Then, in vector notation, the equations of 
motion are x = F(x, t). We will give the details for the case when the phase space is two-dimensional; the 
method in the general case is the same but uglier. 

Consider a set of phase points moving in the (x1, x2)-plane, which, at some instant in time, occupies 
the region Ro, as shown in Figure 14.2. Without losing generality, we may suppose that this occurs at time 
t = 0. After time f, a typical point x of Rp has moved on to position X = X(x, t) and the set as a whole 
now occupies the region 7?;. In this two-dimensional case, the ‘volume’ v(t) of 72; is the area of this region 
in the (x1, x2)-plane. Now 


w= | ax idx = | J dxjdxp, 
Rr Ro 


where J is the Jacobian of the transformation X = X (x, t), that is, 


0X 1/d0x, 0X 1/0Xx2 
J= ‘ (14.29) 
dX2/dx, 0X2/dx2 


Now, for small t, X may be approximated by 


X (x, ft) 


ox 2 
X(x,0) +t ape O(t) 


x+tF(x,0)+ O(t?), 


on using the equation of motion x = F(x, t). The corresponding approximation for J is 


+ a + O(t?) =14+1tdiv F(x, 0) + O(t?). 
4 t=0 


* After the French mathematician Joseph Liouville (1809-1882). 
¥ Since the dimension of the phase space can be any (even) number, this is a generalisation of the notion of 
volume. 
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FIGURE 14.3 An instance of Liouville’s theorem with the 
Hamiltonian H = p* + q?/9. The shaded region moves 
through the phase space. Its shape changes but its area 
remains the same. 


Hence the volume of R; is approximated by 
= / (1 +t div F(x, 0)) dxydx + O(t?) 
Ro 


when f is small. It follows that 


dv 
dt 


= lim Gane) =H div F(x, 0) dxydxy. 
t=<0 t-0 t Ro 


Finally, since the initial instant t = 0 was arbitrarily chosen, this result must apply for any f, that is, 
dv . 
—_—= div F(x, t) dxjdx2 
dt Ri 


at any time ¢. We see that, for general systems of equations, the phase flow does not preserve volume. 
However, if div F(x,t) = 0, then volume is preserved. For the case of Hamilton’s equations with one 
degree of freedom, 


dF, dF 
Ox 0x2 


es _3 aH) 9 
dq \ dp ap\ ag) 


Hence the Hamiltonian phase flow satisfies the condition div F = 0 and so preserves volume. This com- 


div F = 


pletes the proof. Hi 


Liouville’s theorem 


The motions of a Hamiltonian system preserve volume in (q, p)-space. 


A particular instance of Liouville’s theorem is shown in Figure 14.3. The phase paths 
of the Hamiltonian H = p? +.q7/9, shown in Figure 14.1, are concentric similar ellipses. 
Figure 14.3 shows the progress of a region of the phase space lying between two such 
elliptical paths. The region changes shape but its area remains the same. 

Liouville’s theorem has many applications and is particularly important in statistical 
mechanics. The following is a simple example. 
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Example 14.5 No limit cycles in Hamiltonian mechanics 


In the theory of dynamical systems, a periodic solution is said to be an asymptotically 
stable limit cycle if it ‘attracts’ points in nearby volumes of the phase space (see 
Chapter 8). Show that limit cycles cannot occur in the dynamics of Hamiltonian 
systems. 


Solution 


Suppose there were a closed path C in the phase space that attracts points in a nearby 
region R. Then eventually the points that lay in ® must lie in a narrow ‘tube’ of 
arbitrarily small ‘radius’ enclosing the path C. The volume of this tube tends to zero 
with increasing time so that the original volume of 7? cannot be be preserved. This 
is contrary to Liouville’s theorem and so asymptotically stable limit cycles cannot 
exist. i 


Poincaré’s theorem and recurrence 


Many Hamiltonian systems have the property that each path is confined to some bounded 
region within the phase space. Typically this is a consequence of energy conservation, 
where the energy surfaces happen to be bounded. Liouville’s theorem has startling impli- 
cations concerning the motion of such systems. First, we need to prove a result known as 
Poincaré’s recurrence theorem. Poincaré’s theorem is actually a result from ergodic the- 
ory and has many applications outside classical mechanics. However, since we are going 
to apply it to phase space, we will prove it in that context. 


Theorem 14.1 Poincaré’s recurrence theorem Let S be an autonomous Hamilto- 
nian system and consider the motion of the phase points that initially lie in a bounded 
region Ro of the phase space. If the paths of all of these points lie within a fixed bounded 
region of phase space for all time, then some of the points must eventually return to Ro. 


Proof. Let R, be the region occupied by the points after time t. (We will suppose that 721 
does not overlap Ro so that all the points that lay in 7o at time f = O have left 7o at time 
t = tT.) We must show that some of them eventually return to Ro. Let R2, R3, ..., Rn be the 
regions occupied by the same points after times 2T, 3t,..., nt. By Liouville’s theorem, all of 
these regions have the same volume. Therefore, if they never overlap, their total volume will 
increase without limit. But, by assumption, all these regions lie within some finite volume, so 
that eventually one of them must overlap a previous one. This much is obvious, but we must 
now show that an overlap takes place with the original region Ro. 

Suppose it is R,, that overlaps R, (0 < k < m). Each point of this overlap region 
corresponds to an intersection of the paths of two phase points that started out at some points 
X1, X2 of Ro at time ¢ = 0. In the same notation used in the proof of Liouville’s theorem, it 
means that X(x,|,mt) = X(x2, kt). But, since the system is autonomous, the two solutions 
X(x1,¢) and X(x2,t) must therefore differ only by a shift (m — k)t in the origin of time. It 
follows that X(x,, (m — k)t) = X(x2,0) = x2. Thus the phase point that was at x; when 
t = Ois at x2 when t = (m — k)t. This phase point has therefore returned to Ro after time 
(m — k)t and this completes the proof. m 
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FIGURE 14.4 Consequences of Poincaré’s recurrence theorem. Left: the particle sliding 
inside the smooth irregular bowl will eventually almost reassume its initial state. Right: The 
mole of gas molecules, initially all in the left compartment will eventually all be found there 
again. 


Since the recurrence theorem holds for any sub-region of Ro, it follows that, through- 
out Ro, there are phase points that pass arbitrarily close to their original positions. Thus 
if the system S has one of these points as its initial state, then S will eventually become 
arbitrarily close to reassuming that state. Actually, such points are typical rather than 
exceptional. To show this requires a stronger version of Poincaré’s theorem* than we have 
proved here, namely: The path of almost every’ point in Ro passes arbitrarily close to its 
starting point. This implies the remarkable result that for amost every choice of the initial 
conditions, the system S becomes arbitrarily close to reassuming those conditions at later 
times. 

An example of this phenomenon is the motion of a single particle P sliding under 
gravity on the smooth inner surface of a bowl of some irregular shape z = F(x, y), as 
shown in Figure 14.4. This is an autonomous Hamiltonian system with two degrees of 
freedom. Take the Cartesian coordinates (x, y) of P to be generalised coordinates. Then 
the Lagrangian is given by 


L= 


NI- 


m (3? +5? +27) — Mgz 
= hm (#7 +5? + (Fyx%+F,y3)) _meF, 
where F, , = dF /dx and F, y = dF /dy. The conjugate momenta are 
Px =m (®+(Fyxt+F,yy)Fyx), Py =m (V+(Fyxt+F,y3)Fyy). 


Just because energy is conserved, it does not necessarily mean that Poincaré’s theorem 
applies. We must show that the energy surfaces in phase space are bounded. The proof of 
this is as follows: 


* See Walters [12]. 
* This means that the set of exceptional points has measure zero. For example, in a two-dimensional phase 
space, a curve has zero measure. 
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In this case, Ro is some bounded region of the phase space (x, y, px, Py). Suppose that zo is the maximum 
value of z and that Tg is the maximum kinetic energy associated with points of Ro. Then, by energy 
conservation, the maximum value of z in the subsequent motions cannot exceed z™* = zg + (To/mg). 
Hence, providing the bow] rises to at least this height, the motions are confined to values of (x, y) that satisfy 
F(x, y) < z™®*. Tt follows that both x and y are bounded in the subsequent motions. Also, if the lowest 
point of the bowl is at z = 0, the value of T in the subsequent motions cannot exceed T™** = Ty + mgzo. 
It follows that x and y are bounded in the subsequent motions and this implies that the same is true for px 
and py. Hence x, y, px, and py are all bounded in the subsequent motions. This means that that the paths 
of the phase points that lie in Rg when ¢ = 0 are confined to a bounded region of the phase space for all 


time. Poincaré’s theorem therefore applies. 


Hence, if the particle P is released from rest (say) at some point A on the surface of 
the bowl, then, whatever the shape of the bowl, P will become arbitrarily close to being 
at rest at A at later times. 

In the same way, it follows that if a compartment containing a mole of gas molecules 
is separated from an empty compartment by a partition, and the partition is suddenly 
punctured, then at (infinitely many!) later times the molecules will all be found in the 
first compartment again. This remarkable result, which seems to be in contradiction to 
the second law of thermodynamics, appears less paradoxical when one realises that ‘later 
times’ may mean 107° years later! 


Question Exceptional points 


How do you know that the initial conditions you have chosen do not correspond to an 
‘exceptional point’ for which Poincaré’s theorem does not hold? 


Answer 
You don’t know, but you would be very unlucky if this happened! 


Problems on Chapter 14 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Finding Hamiltonians 
14.1 Find the Legendre transform G(v1, v2, w) of the function 
F(uj,u2,w) = on; — 3uju2+ us + 3wu, 
where w is a passive variable. Verify that dF /dw = —dG/dw. 
14.2 A smooth wire has the form of the helix x = acos@, y = asin@, z = b@, where 
@ is a real parameter, and a, b are positive constants. The wire is fixed with the axis Oz 


pointing vertically upwards. A particle P of mass m can slide freely on the wire. Taking 6 as 
generalised coordinate, find the Hamiltonian and obtain Hamilton’s equations for this system. 
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14.3 Projectile Using Cartesian coordinates, find the Hamiltonian for a projectile of mass m 
moving under uniform gravity. Obtain Hamilton’s equations and identify any cyclic coordi- 
nates. 


14.4 Spherical pendulum The spherical pendulum is a particle of mass m attached to a fixed 
point by a light inextensible string of length a and moving under uniform gravity. It differs 
from the simple pendulum in that the motion is not restricted to lie in a vertical plane. Show 
that the Lagrangian is 


L = }ma? (# + sin? @ é) +mgacos6, 


where the polar angles 0, @ are shown in Figure 11.7. Find the Hamiltonian and obtain Hamil- 
ton’s equations. Identify any cyclic coordinates. 


14.5 The system shown in Figure 10.9 consists of two particles P; and P2 connected by a light 
inextensible string of length a. The particle P; is constrained to move along a fixed smooth 
horizontal rail, and the whole system moves under uniform gravity in the vertical plane through 
the rail. For the case in which the particles are of equal mass m, show that the Lagrangian is 


L=3m (2:7 + 2ax6 + 6?) + mga cos 8, 


where x and 6 are the coordinates shown in Figure 10.9. 
Find the Hamiltonian and verify that it satisfies the equations t = 0H/dp, and 6 = 
0H /dpe. [Messy algebra.] 


14.6 Pendulum with a shortening string A particle is suspended from a support by a light 
inextensible string which passes through a small fixed ring vertically below the support. The 
particle moves in a vertical plane with the string taut. At the same time, the support is made to 
move vertically having an upward displacement Z(t) at time ¢. The effect is that the particle 
oscillates like a simple pendulum whose string length at time ¢ is a— Z(t), where a is a positive 
constant. Show that the Lagrangian is 


L=im ((a =2)6? + 2?) +mg(a— Z)cos8@, 


where @ is the angle between the string and the downward vertical. 
Find the Hamiltonian and obtain Hamilton’s equations. Is H conserved? 


14.7 Charged particle in an electrodynamic field The Lagrangian for a particle with mass m 
and charge e moving in the general electrodynamic field {E(r, t), B(r, t)} is given in Cartesian 
coordinates by 


L(r,r,t) = smi-r—eg(r,t)t+er-A(r,t), 


where r = (x, y, z) and {@, A} are the electrodynamic potentials of field {E, B}. Show that 
the corresponding Hamiltonian is given by 


(PSEA ADGA) 


A(r, p,t)= oe 


, 
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where p = (px, Py, Px) are the generalised momenta conjugate to the coordinates (x, y, z). 
[Note that p is not the ordinary linear momentum of the particle.] Under what circumstances 
is H conserved? 


14.8 Relativistic Hamiltonian The relativistic Lagrangian for a particle of rest mass mop mov- 
ing along the x-axis under the potential field V (x) is given by 


L=moc?{1-(1 eye V 
=moc a (x). 


Show that the corresponding Hamiltonian is given by 


9\ 1/2 
H = moc? (1+( Fol ) —moc* + V(x), 


moc 
where p, is the generalised momentum conjugate to x. 


14.9 A variational principle for Hamilton’s equations Consider the functional 


ty 


Naw, por= | (HG, pt) +4-p— 4: p)at 


10 


of the 27 independent functions g1(t),...,¢n(t), pi(t),---, Pn(t). Show that the extremals 
of J satisfy Hamilton’s equations with Hamiltonian 17. 


Liouville’s theorem and recurrence 


14.10 In the theory of dynamical systems, a point is said to be an asymptotically stable equi- 
librium point if it ‘attracts’ points in a nearby volume of the phase space. Show that such 
points cannot occur in Hamiltonian dynamics. 


14.11 A one dimensional damped oscillator with coordinate q satisfies the equation g + 4g + 
3q = 0, which is equivalent to the first order system 


g =v, v= —3q —4v. 
Show that the area a(t) of any region of points moving in (g, v)-space has the time variation 
a(t) =a(O)e’. 
Does this result contradict Liouville’s theorem? 


14.12 Ensembles in statistical mechanics In statistical mechanics, a macroscopic property 
of a system S is calculated by averaging that property over a set, or ensemble, of points moving 
in the phase space of S. The number of ensemble points in any volume of phase space is rep- 
resented by a density function p(q, p,t). If the system is autonomous and in statistical equi- 
librium, it is required that, even though the ensemble points are moving (in accordance with 
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Hamilton’s equations), their density function should remain the same, that is, o = p(q, p). 
This places a restriction on possible choices for p(qg, p). Let Ro be any region of the phase 
space and suppose that, after time ¢, the points of Ro occupy the region R;. Explain why 
statistical equilibrium requires that 


/ pia. pydv= | pq, p)dv 
Ro Rt 


and show that the uniform density function p(q, p) = po satisfies this condition. [It can be 
proved that the above condition is also satisfied by any density function that is constant along 
the streamlines of the phase flow. ] 


14.13 Decide if the energy surfaces in phase space are bounded in the following cases: 

(i) The two-body gravitation problem with E < 0. 

(ii) The two-body gravitation problem viewed from the zero momentum frame and with 
E <0. 

(iii) The three-body gravitation problem viewed from the zero momentum frame and with 
E <0. Does the solar system have the recurrence property? 


Poisson brackets 


14.14 Poisson brackets Suppose that u(q, p) and v(q, p) are any two functions of position 
in the phase space (q, p) of a mechanical system S. Then the Poisson bracket [v, v] of u and 
v is defined by 


n 
du dv du dv 
[u, v] = gradg u- gradyv — grad yu: gradg v = ( )s 
q Pp P q » aq; 8p; Ap; Aq; 


j=l 


The algebraic behaviour of the Poisson bracket of two functions resembles that of the 
cross product U x V of two vectors or the commutator UV — VU of two matrices. The 
Poisson bracket of two functions is closely related to the commutator of the corresponding 
operators in quantum mechanics.” 


Prove the following properties of Poisson brackets. 


Algebraic properties 


[u,u] =0, [v,u] = —[u, v], [Ayuy +Agqu2,v] = Ay[uy,v] +A2[U2, v] 


[[u,v],w]+[[w,u],v]+[[v, w],u] =0. 


This last formula is called Jacobi’s identity. It is quite important, but there seems to be no way of proving 
it apart from crashing it out, which is very tedious. Unless you can invent a smart method, leave this one 
alone. 


* The commutator [U, V] of two quantum mechanical operators U, V corresponds to ifi[u, v], where h is 
the modified Planck constant, and [v, v] is the Poisson bracket of the corresponding classical variables 
Uu, Vv. 
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Fundamental Poisson brackets 
lgj,9x] =9, [pj, pe] =90, [gj, Pk | = Sjk, 


where 6 jx is the Kroneker delta. 
Hamilton’s equations 


Show that Hamilton’s equations for S can be written in the form 
qj =(q;, 41], pj =(p;, 411, (ls 7 2m) 


Constants of the motion 


(i) Show that the fotal time derivative of u(q, p) is given by 


du 
—=[u,H 
a lu, H | 


and deduce that u is a constant of the motion of S if, and only if, [u, H ] = 0. 


(ii) If wu and v are constants of the motion of S, show that the Poisson bracket [u, v ] is 
another constant of the motion. [Use Jacobi’s identity.] Does this mean that you can keep 
on finding more and more constants of the motion ? 


14.15 Integrable systems and chaos A mechanical system is said to be integrable if its 
equations of motion are soluble in the sense that they can be reduced to integrations. (You do 
not need to be able to evaluate the integrals in terms of standard functions.) A theorem due 
to Liouville states that any Hamiltonian system with n degrees of freedom is integrable if it 
has n independent constants of the motion, and all these quantities commute in the sense that 
all their mutual Poisson brackets are zero.* The qualitative behaviour of integrable Hamilto- 
nian systems is well investigated (see Goldstein [4]). In particular, no integrable Hamiltonian 
system can exhibit chaos. 

Use Liouville’s theorem to show that any autonomous system with n degrees of freedom 
and n — | cyclic coordinates must be integrable. 


Computer assisted problem 


14.16 The three body problem 


There is no general solution to the problem of determining the motion of three or more bodies moving under 
their mutual gravitation. Here we consider a restricted case of the three-body problem in which the mass of 
one of the bodies, P, is much smaller than that of the other two masses, which are called the primaries. In 
this case we neglect the effect of P on the primaries which therefore move in known fixed orbits. The body 
P moves in the time dependent, gravitational field of the primaries. 


* This result is really very surprising. A general system of first order ODEs in 2n variables needs 2n 
integrals in order to be integrable in the Liouville sense. Hamiltonian systems need only half that number. 
The theorem does not rule out the possibility that that there could be other classes of integrable systems. 
However, according to Arnold [2], every system that has ever been integrated is of the Liouville kind! 
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FIGURE 14.5 There is no such thing as a typical orbit in the three-body 
problem. The orbit shown corresponds to the initial conditions x = 0, y = 0, 
Px = 1.03, py = 0 and is viewed from axes rotating with the primaries. 


Suppose the primaries each have mass M and move under their mutual gravitation around 
a fixed circle of radius a, being at the opposite ends of a rotating diameter. The body P moves 
under the gravitational attraction of the primaries in the same plane as their circular orbit. 
Using Cartesian coordinates, write code to set up Hamilton’s equations for this system and 
solve them with general initial conditions. [Take M as the unit of mass, a as the unit of length, 
and take the unit of time so that the speed of the primaries is unity. With this choice of units, 
the gravitational constant G = 4.] 

By experimenting with different initial conditions, some very weird orbits can be found 
for P. It is interesting to plot these relative to fixed axes and also relative to axes rotating with 
the primaries, as in Figure 14.5. Some fascinating cases are shown by Acheson [] and you 
should be able to reproduce these. [Acheson used a different normalisation however and his 
initial data needs to be doubled to be used in your code.] 


Part Four 


FURTHER TOPICS 


CHAPTERS IN PART FOUR 


Chapter 15 The general theory of small oscillations 

Chapter 16 Vector angular velocity and rigid body kinematics 
Chapter 17 Rotating reference frames 

Chapter 18 Tensor algebra and the inertia tensor 


Chapter 19 Three dimensional rigid body motion 


Chapter Fifteen 
a | 


The general theory of 
small oscillations 


KEY FEATURES 

The key features of this chapter are the existence of small oscillations near a position of stable 
equilibrium and the matrix theory of normal modes. A simpler account of the basic principles 
is given in Chapter 5. 


Any mechanical system can perform oscillations in the neighbourhood of a position 
of stable equilibrium. These oscillations are an extremely important feature of the system 
whether they are intended to occur (as in a pendulum clock), or whether they are undesir- 
able (as in a suspension bridge!). Analogous oscillations occur in continuum mechanics 
and in quantum mechanics. Here we present the theory of such oscillations for conserva- 
tive systems under the assumption that the amplitude of the oscillations is small enough 
so that the linear approximation is adequate. A simpler account of the theory is given in 
Chapter 5. This treatment is restricted to systems with two degrees of freedom and does 
not make use of Lagrange’s equations. Although the material in the present chapter is 
self-contained, it is helpful to have solved a few simple normal mode problems before. 

The best way to develop the theory of small oscillations is to use Lagrange’s equations. 
We will show that it is possible to approximate the expressions for T and V from the 
start so that the linearized equations of motion are obtained immediately. The theory is 
presented in an elegant matrix form which enables us to make use of concepts from linear 
algebra, such as eigenvalues and eigenvectors. We prove that fundamental result that 
a system with n degrees of freedom always has n harmonic motions known as normal 
modes, whose frequencies are generally different. These normal frequencies are the 
most important characteristic of the oscillating system. One important application of the 
theory is to the internal vibrations of molecules. Although this should really be treated 
by quantum mechanics, the classical model is extremely valuable in making qualitative 
predictions and classifying the vibrational modes of the molecule. 


15.1 STABLE EQUILIBRIUM AND SMALL OSCILLATIONS 


Let S be a standard mechanical system with n degrees of freedom and with gen- 
eralised coordinates g = (q1,92,---,9n). Suppose also that S is conservative. Then the 
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motion of S is determined by the classical Lagrange equations of motion 


d ( 0T oT aV 
= (<j<n), (15.1) 
dt 0qj 0qj 0qj 


where T (q, q) and V (q) are the kinetic and potential energies of S. In particular, these 
equations determine the equilibrium positions of S. The point g in configuration space 
is an equilibrium position of S if (and only if) the constant function g = q satisfies the 
equations (15.1). For a standard system, T has the form 


n n 
T=) >> tin(Q) Gide (15.2) 
j=lk=l 
that is, a homogeneous quadratic form in the variables g1, g2,..., Gn, with coefficients 


depending on q. It follows that the left side of the j-th Lagrange equation has the form 
. lik . 
2» (Gt ‘ant taii) - OY (Fats). 
j=l k=1 


which takes the value zero when the constant function gq = g® is substituted in. It follows 
that g = q will satisfy the equations (15.1) if (and only if) 


ae 


(<j <a”) (15.3) 
aqj 


when g = gq). In other words, we have the result: 


Stationary points of V 


The equilibrium positions of a conservative system are the stationary points of its 
potential energy function V (q). 


Stable equilibrium 


The stability of equilibrium is most easily understood in terms of the motion of the phase 
point of S in the Hamilton phase space (q, p) (see Chapter 14). If g is an equilibrium 
point of S in configuration space, then (q©, 0) is the corresponding equilibrium point of 
S in phase space; if the phase point starts at (q, 0), then it remains there, its trajectory 
consisting of the single point (q, 0). 

Now consider phase paths that begin at points that lie inside the sphere $3 in phase 
space which has centre (q), 0) and radius 6. When 4 is small, this corresponds to starting 
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FIGURE 15.1 The circle Cs; must be chosen 
small enough so that all the phase paths that 
start within it remain within the given circle 
Ca. 


the system from a configuration close to the equilibrium configuration with a small kinetic 
energy. Let S, be the smallest sphere in phase space with centre (q, 0) that contains 
all the phase paths that begin within S3. 


Definition 15.1 Stable equilibrium If the radius A tends to zero as the radius 8 tends 
to zero, then the equilibrium point at (q©, 0) is said to be stable.* 


This means that, if S is given a small nudge from a configuration close to a position of 
stable equilibrium, then the subsequent motion of S (in configuration space) is restricted 
to a small neighbourhood of the equilibrium point. Thus any mechanical system can 
perform small motions near a position of stable equilibrium. These motions are generally 
called small oscillations. 

We know that the equilibrium positions of S correspond to the stationary points of 
the potential energy V (q), but we have yet to identify which of these points correspond to 
stable equilibrium. In fact it is quite easy to prove the following important result: 


Minimum points of V 


The minimum points of the potential energy function V (q) are positions of stable 
equilibrium of the system S. 


Proof. Without loss of generality, suppose that the minimum point of the function V (q) is at q = 0 and 
that V (0) = 0. Take any A > 0. Then we must show that we can find a sphere Ss in phase space such that 
all the paths that begin within it remain inside the sphere S,. This is illustrated in Figure 15.1 for the only 
case that can be drawn, namely, when the phase space is two-dimensional; in this case, the ‘spheres’ are 
circles. 

The result follows from energy conservation. Let T(q, p) be the kinetic energy of the system. The 
total energy is then 


E(q, p) =TQq. p)+V@). 


* In the dynamical systems literature, this is known as Liapunov stability. 
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(~—8) 


FIGURE 15.2 The double pendulum. Left: The generalised coordinates 6, @. Right: The 
velocity diagram. 


Since g = 0 is a minimum point of V (q) and T (q, p) is positive for p ¥ 0, it follows that (0, 0), the origin 
of phase space, is a minimum point of the function E(q, p); the value of E at the minimum point is zero. 
The value of E on the sphere S, must therefore be greater than some positive number E ,. On the other 
hand, by making the radius 5 small enough, it follows by continuity that the value of E within the sphere 
Ss can be made as close to E (0, 0) (= 0) as we wish; we can certainly make it less than EA. Consider now 
any phase path starting within the circle Cs. Then, by energy conservation, E is constant along this path 
and is /ess than E,. Such a path cannot cross the sphere Sq, for, if it did, the value of E at the crossing 
point would be greater than E,, which is not true. Hence, any phase path starting within the sphere Ss 
must remain within the sphere Sa and this completes the proof. 


Example 15.1 Stability of equilibrium 


Consider the double pendulum shown in Figure 15.2 which moves under uniform 
gravity. Show that the vertically downwards configuration is a position of stable equi- 
librium. 


Solution 

The system is assumed to move in a vertical plane through the suspension point O. 
In terms of the generalised coordinates 0, @ shown, the vertically downwards config- 
uration corresponds to the point 6 = ¢ = 0 in configuration space. The gravitational 
potential energy is given by 


V =(M+™m)gb(1 —cos@) +mgc(1 — cos @¢) 


so that OV /00 = (M+m)gbsin@ = 0 when 6 = ¢ = 0. The same is true for dV /d@ 
and so the point 6 = @ = 0 is a stationary point of the function V (6, @). It follows 
that the downwards configuration is a position of equilibrium. 

We could determine the nature of this stationary point by looking at the second 
derivatives of V(@,), but there is no need because it is evident that V(@,@) > 
V (0, 0) unless 6 = ¢ = 0. Thus (0, 0) is a minimum point of V and so the downwards 
configuration is a position of stable equilibrium. @ 
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15.2 THE APPROXIMATE FORMS OF 7 AND V 


Now that we know small oscillations can take place about any minimum point of 
V, we can go on to find approximate equations that govern such motions. The obvious (but 
not the best!) way of doing this is as follows: Take the example of the double pendulum. 
In this case, T and V are given (see Figure 15.2) by 


T =1M(o6)? + 4m ((66)? + (cb)? + 2(66) (cd) cos(6 — ¢)) (15.4) 
V =(M+m)gb(1 —cos@) + mgc(1 —cos¢@). (15.5) 


If these expressions are substituted into the Lagrange’s equations, we obtain (after some 
simplification) the exact equations of motion 


(M + m)b@ +mc cos(6 — b)d +mce sin(@ — )o? +(M+m)gsiné = 0, 
bcos(6 — o)6 +c$ — bsin(6 — $)6? + gsing = 0. 


This formidable pair of coupled, second order, non-linear ODEs govern the large oscilla- 
tions of the double pendulum. However, for small oscillations about 6 = ¢ = 0, these 
equations can be approximated by neglecting everything except linear terms in 6, @ and 
their time derivatives. On carrying out this approximation, the equations simplify dramat- 
ically to give 


(M +m)b6+mcé+(M+m)go =0, (15.6) 
b6+co+g¢o=0. (15.7) 


These are the linearised equations governing small oscillations of the double pendulum 
about the downward vertical. They are a pair of coupled, second order, linear ODEs with 
constant coefficients. An explicit solution is therefore possible. 

While the above method of finding the linearised equations of motion is perfectly 
correct, it is wasteful of effort and is also unsuitable when presenting the general theory. 
What we did was to obtain the exact expressions for T and V, derive the exact equations 
of motion, and then linearise. In the linearisation process, many of the terms we took 
pains to find were discarded. It makes far better sense to approximate the expressions 
for T and V from the start so that, when these approximations are used in Lagrange’s 
equations, the linearised equations of motion are produced immediately. The saving in 
labour is considerable and this is also a nice way to present the general theory. 

Consider the double pendulum for example. The exact expression for V is given by 
equation (15.5) and when @, ¢ are small, this is given approximately by 


V= 5(M +m)gb 0? + smgc gp ++: ; 


where the neglected terms have power four or higher. Similarly, when 6, ¢ and their time 
derivatives are small, T is given approximately by 


T = 3M (b6)? + 3m ((b8)? + (cb)? + 2006)(c4)(1 + -++)), 
= 5(M + m)b?6? + mbc 6g + 4mc*g? +++, 
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where the neglected terms have power four (or higher) in small quantities. If these approx- 
imate forms for T and V are now substituted into Lagrange’s equations, the linearised 
equations of motion (15.6), (15.7) are obtained immediately. [Check this.] This is clearly 
superior to our original method. 


The general approximate form of V 


In the general case, suppose that the potential energy V (q) of the system S has a minimum 
at q = 0 and that V (0) = 0. (If the minimum point of V is not at q = 0, it can always be 
made so by a simple change of coordinates.) Then, for g near 0, V (q) can be expanded as 
an (n-dimensional) Taylor series in the variables q1, g2, ..., Gn. For the special case when 
S has two degrees of freedom, this series has the form 


aVv aVv 
V (q1, 92) = V (0,0) + (Sa + ~~ «2) 
aq Oq1 


7 a°V hans av Pad ays. 

727 Za 1192 TF 7 er 

aq? : 0q19q2 aq5 ‘ 
where all partial derivatives of V are evaluated at the expansion point gj = qi = 0. 
Now V has been selected so that V (0,0) = 0. Also, since (0, 0) is a stationary point of 
V (41, 42), it follows that 0V/dg, = 0V/dq2 = O there. Thus the constant and linear 
terms are absent from the Taylor expansion of V. It follows that V can be approximated 
by 


VPP (gy, q2) = vi1g7 + 20129192 + 02293, 


where v11, V12, V22 are constants given by 


02V a2V a2V 
v1 = 70,0) v2 =7—— 0,0) v2 = —, (0,0) 
aq; dq19q2 0q5 


and the neglected terms have power three (or higher) in the small quantities q1, g2. The 
corresponding approximation to V (q) in the case when S has n-degrees of freedom is 


n n 


vq) = >>> vj ajae (15.8) 


j=lk=l 


where the {v ;,} are constants given by 


a2V 


80798 | a4 


Vik = Vij = 
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and the neglected terms have power three (or higher) in the small quantities g1, 2, ..., 
qn. This is the general form of the approximate potential energy V*??(q). It is a homo- 
geneous quadratic form in the variables q1, g2,..-, Gn- 

In the theory that follows, we will always assume that gq = 0 is also a minimum point 
of the approximate potential energy V*PP(qg).* This condition is equivalent to requiring 
that the quadratic form (15.8) should be positive definite. This simply means that it takes 
positive values except when q = 0. 


The general approximate form of T 


For any standard mechanical system with generalised coordinates q, the kinetic energy T 
has the form 


n n 


T(q.4) = >_> tik (Q) Gide: 


j=lk=l 


a quadratic form in the variables q1, 2, ..., Gn with coefficients that depend on q. If 
we expand each of these coefficients as a Taylor series about g = 0, the constant term is 
simply (0) and 
n n 
P=) tixO)Qjdxk + -:-. 
j=lk=l 


It follows that T can be approximated by 


n 


n 
TAPP = ND tik I IG (15.9) 


jelk=l 


where the constants {t;<} are what we previously called {t;;(0)}, and the neglected terms 
have power three (or higher) in the small quantities ¢1, 92, -.., Gn. Vis G25 «++» Gn» This 
is the general form of the approximate kinetic energy V*??(q). It is a homogeneous 
quadratic form in the variables g1, g2, ..., Gn. Since T (q, q) > 0 except when qg = 9, it 
follows that the quadratic form (15.9) must also be positive definite. 


Useful tip: Since T*?P(q) = T (0, q), it follows that T*PP can be be found directly by 
calculating T when the system is passing through the equilibrium position; the general 
formula for T need never be found! 


* Tt might appear that this follows the fact that g = 0 is known to be a minimum point of the exact V, but 
this is not necessarily so. For example, if V (qj, q2) = qt + qs. then V@PP = a which does not have 
a strict minimum at gq; = g2 = 0. The general theory of small oscillations does not apply to such cases, 
and we exclude them. 
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The V-matrix and the 7-matrix 


In order to express the general theory concisely, we introduce the n x n matrices V and T 
as follows: 


Definition 15.2 The V-matrix and the T-matrix The symmetric n x n matrix V whose 
elements are the coefficients {v jx} that appear in the formula (15.8) is called the V -matrix. 
The symmetric n x n matrix T whose elements are the coefficients {t |x} that appear in the 
formula (15.9) is called the T -matrix. 


In terms of V and T, the approximate potential and kinetic energies of S can be written 
in compact matrix notation:* 


Quadratic forms for V2PP and 7 @PP 


n n 


a Soo viK aide =q-V-q 
f= (15.10) 
te Soyo tik aide =q'-T-q 


j=lk=l 


where q is the column vector with elements {g ;}, and q is the column vector with elements 
{qj}. 


Example 15.2 Finding V and T for the double pendulum 


Find the matrices V and T for the double pendulum. 


Solution 
For the double pendulum, V*?? is given by 


vere — 5(M +m)gb 6? + 4mgc ¢’, 
1 
L(M+m)gb 0 ) (3) 
= (0 2 
( 6) ( 0 smgc p 
and T*PP is given by 
TAPP — 5(M + m)b*6" +mbc 6d + smc? 


1 21 j 
= (6 $) Gena is | (5): 


1 
zmbc zc 


c 


* The notation x’ means the transpose of the column vector X. The alternative notation x’ would cause 


confusion here. 
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Hence, V and T are the 2 x 2 matrices 


v= ae 0 y + 


0 smgc 


1 21 
3(M + m)b ey - 


ll 
a 


1 1 
xzmbc zc 


15.3. THE GENERAL THEORY OF NORMAL MODES 


In this section, we develop the general theory of normal modes for any oscillating 
system. This extends the method described in Chapter 5, which was restricted to two 
degrees of freedom. 


The small oscillation equations 

The first step is to obtain the general form of the small oscillation equations. This is 
done by substituting the approximate potential and kinetic energies V*PP and T*PP into 
Lagrange’s equations. Now 


OT @PP aV @PP 
. ik = 0, 
0qj oat 0qj qj 


=2 5 eau 


so that Lagrange’s equations become 


Small oscillation equations 


n 


Expanded form: > (tix Gk + VjK dk) =0 
= (15.11) 


(<j <n) 


Matrix form: T-q+V-q=0 


in the expanded and matrix forms respectively. These are the linearised equations for the 
small oscillations of S about the point q = 0. They are a set of n coupled second order 
linear ODEs satisfied by the unknown functions g1(f), g2(t), ..., Gn(t). 


Normal modes 


The next step is to find a special class of solutions of the small oscillation equations known 
as normal modes. We will show later that the general solution of the small oscillation 
equations can be expressed as a sum of normal modes. 


Definition 15.3 Normal mode A solution of the small oscillation equations that has 
the special form 
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Expanded form: qj = 4; cos(wt — y) 
(<j <n) (15.12) 
Matrix form: q = acos(wt — y) 


where the {aj}, w and y are constants, is called a normal mode of the system S. 


Notes. In a normal mode, the coordinates qi, 92,...,9n all vary harmonically in time 
with the same frequency w and the same phase y; however, they generally have different 
amplitudes a\,a2,..., Qn. The n-dimensional quantity a = (a1, d2,...,@n) is called the 
amplitude vector of the mode and, when considered to be a column vector, will be written 
a. Without losing generality, the angular frequency w can be assumed to be positive. 

On substituting the normal mode form (15.12) into the small oscillation equations 
(15.11), we obtain, on cancelling by the common factor cos(@t — y), 


Equations for the amplitude vector 


n 
Expanded form: > (vie = wrx) ar =0 


= (15.13) 


(dsjsn) 


Matrix form: (V =" T) -a=—0 


This is ann x n system simultaneous linear algebraic equations for the coordinate 
amplitudes {az}. A normal mode will exist if we can find constants {az}, @ so that the 
equations (15.13) are satisfied. Since the equations are homogeneous, they always have 


the trivial solution a, a2 on An 0, whatever the value of w. However, the 


trivial solution corresponds to the equilibrium solution q, = q2 = -+:: = qn = 0 of the 
governing equations (15.11), which is not a motion at all. We therefore need the equations 
(15.13) to have a non-trivial solution for the {a,}. There is a simple condition that this 
should be so, namely that the determinant of the system of equations should be zero, that 
is, 


Determinantal equation for w 
(15.14) 
det (V — wT) =0 


This is the equation satisfied by the angular frequency w in any normal mode of the system 
S. When expanded, this is a polynomial equation of degree n in the variable w”. If this 
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equation has any real positive roots wr, Ws, ..., then, for each of these values of w, the 
linear equations (15.13) will have a non-trivial solution for the amplitudes {az} and a 
normal mode will exist. 


Definition 15.4 Normal frequencies The angular frequencies w,, w2, ... of the nor- 
mal modes are called the normal frequencies of the system S. 


The normal frequencies are a very important characteristic of an oscillating system. 
They are found by solving the determinantal equation (15.14) for w. In the example that 
follows, we find the normal frequencies of the double pendulum, and three further worked 
examples are given in section 15.5. 


Example 15.3. Normal frequencies of the double pendulum 


Find the normal frequencies of the double pendulum for the case in which M = 3m 
andc = b. 


Solution 


With these special values, the matrices V and T become 


40 41 
_1 =1mp? 
v= jmeo (4 a T= 5mb € ae 


The determinantal equation for w is therefore 


det | }meo ( : — smb*a (; | = 0, 


which can be simplified into the form 


4n? —4@? —w? 


where n? = g/b. On expanding this determinant, we obtain 
30+ — 8n?a? + 4n4 = 0, 
which is a quadratic equation in the variable w~. This is the equation satisfied by the 


normal frequencies. This quadratic factorises and has two real positive roots wi, ws 
for w”, where 


ws = 2n?, 


where n* = g/b. The double pendulum therefore has the two normal frequencies 
(2g /3b)!/? and (2¢/b)!/?. 
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FIGURE 15.3 Normal modes of the double pendulum. Left: The slow mode. Right: The fast 
mode. (The angle €, which should be small, is made large for clarity.) 


Question Form of the normal modes 


What do the normal mode motions of the double pendulum look like? 


Answer 

To answer this we need to find the coordinate amplitudes in each of the normal modes. 
If the amplitudes of 6 and @ are a; and a2 respectively, then these amplitudes satisfy 
the linear equations 


An? —4a* —a* ay\ _ 
=o nm=w)\a) 


Slow mode: When w* = or = 2n? /3, the equations for the amplitudes aj, a2 
become, after simplification, 


Each of these equations is equivalent to the single equation 2a; = az so that we have 
the family of non-trivial solutions a; = €, a2 = 2€, where € can take any (non-zero) 
value. There is therefore just one slow normal mode. It has the form 


6 = €cos(./2/3nt — y), 
ob = 2€ cos(./2/3 nt — y), 


where the amplitude factor € and phase factor y can take any values.* This mode is 
shown in Figure 15.3 (left). 


(15.15) 


Fast mode: In the fast mode, we have w* = 2n? and, by following the same proce- 
dure, we find that there is also one fast normal mode. It has the form 


6= ecos(/2nt — y); 


@ = —2€ cos(V2nt — y), (15.16) 


where the amplitude factor € and phase factor y can take any values. This mode is 
shown in Figure 15.3 (right). 


* However, the linearised theory is a good approximation only when € is small. 
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15.4 EXISTENCE THEORY FOR NORMAL MODES 


So far we have not said anything general about the number of normal mode 
motions that a system posesses. This is related to the number of real positive roots of 
the equation 


det (V—1AT) =0. (15.17) 


When expanded, this is a polynomial equation of degree n in the variable 4, where 7 is 
the number of degrees of freedom of the system S. Such an equation always has n roots 
in the complex plane, but there seems to be no reason why any of them should be real, 
let alone positive. In fact, all of the roots of equation (15.17) are real and positive. This 
follows from generalised eigenvalue theory, which we will now develop. 


Definition 15.5 Generalised eigenvalues and eigenvectors LetK andLbe real nx 
n matrices. If there exists a number i and a (non-zero) column vector X such that 


K-x=AL-x, (15.18) 


then d is said to be a generalised eigenvalue of the matrix K (with respect to the matrix 
L) and x is a corresponding generalised eigenvector.* 


The defining equation (15.18) can also be written 
(K—AL)-x=0, (15.19) 
which has a non-zero solution for X only when 
det(K — AL) = 0. (15.20) 


This is the equation satisfied by the eigenvalues. Provided that L is a non-singular matrix, 
the eigenvalue equation is a polynomial equation of degree n in A, which has n roots in 
the complex plane. The more one knows about the matrices K and L, the more one can 
say about their eigenvalues and eigenvectors. 


Theorem 15.1 Eigenvalues of symmetric, positive definite matrices If K and L 
are real symmetric matrices and L is positive definite,’ then all the eigenvalues are real. 
If the matrix K is also positive definite, then all the eigenvalues are positive. 


Proof. Let x be any complex column vector and consider the scalar quantity X’ - K - x, where X is the 
complex conjugate of x. Then, since K is real and symmetric, 


-K-X=x'-K-X=(x'-K-X) =X’-K’-x 
K-x 


* Ordinary eigenvalues and eigenvectors correspond to the special case when L = 1, the identity matrix. 
+ A matrix A is called positive definite if its associated quadratic form x’ - A - X is positive definite. Since 
this condition is known to hold for V and T, both these matrices must be positive definite. 
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Hence X’ - K - x must be real, and, by a similar argument, x’ .L- x must also be real. 
Also, if we write X in terms of its real and imaginary parts in the form X = U+/ V, then 


xX’-L-x = (u—iv)’-L-(u+iv) 
=u’-L-u+v-L-v+i(u’-L-v—v’-L-u) 
=u’-L-u+v-L-v 

since X’ - L- x is known to be real. Since L is a positive definite matrix, it follows that u’ - L - u is positive 
except when u = 0, and v’ - L- v is positive except when v = 0. Hence X’ - L - x is positive except when 
x=0. 
Now suppose that X is a complex eigenvector corresponding to the complex eigenvalue 2. Then 
x’ -K-x=xX’-(K-x) =X’ -(AL- xX) 
=) (xX -L-x). 


But X’ - K-x is known to be real and xX’ -L- x is known to be real and positive (since the complex eigenvector 
X is not zero). Hence the eigenvalue 4 must be real. 

The eigenvalues are now known to be real, and we may therefore restrict the eigenvectors to be real 
too. Suppose that the real eigenvalue 2 has real eigenvector x and that the matrix K is now also positive 
definite. Then 


x’ -K-x=A) (x’-L-x). 


But, since K and L are both positive definite matrices, the quantities x’ - K- x and x’ - L- x are both positive. 
It follows that 4 must also be positive. Hi 


Since the matrices V and T are both symmetric and positive definite, the above the- 
orem applies to normal mode theory. It follows that the roots of the determinantal equa- 
tion (15.17) are all real and positive. If these roots are distinct (the most common case), 
then there are 7 distinct normal frequencies @), @2, ..., @y. It is however possible for 
the determinantal equation (15.14) to have repeated roots, so that there are fewer than n 
distinct normal frequencies. This usually happens when the system has symmetry; the 
spherical pendulum oscillating about the downward vertical is a simple example. 

The number of normal modes associated with a particular normal frequency, w (say), 
depends on whether wr is a simple or repeated root of the eigenvalue equation (15.17). 
It can be proved that, if Ort is a simple root, then the equations (15.13) for the amplitude 
vector a have a non-trivial solution that is unique to within a multiplied constant. There is 
therefore only one normal mode associated with the normal frequency w;. More generally, 
it can be proved that, if the root is repeated & times, then the equations (15.13) for the 
amplitude vector a have k linearly independent solutions.* The normal frequency then 
has k normal modes associated with it instead of one. It follows that, in all cases, we have 
the fundamental result that the total number of normal modes is always equal to n, the 
number of degrees of freedom of the system. 


* Tt follows from the orthogonality relations (see section 15.6) that the amplitude vectors of the normal 
modes must be linearly independent and therefore cannot exceed n in number. Hence, when there are n 
distinct normal frequencies, each frequency must have exactly one normal mode associated with it. The 
corresponding result in the degenerate case is not easy to prove and is beyond the scope of a mechanics 
text. (See Anton [7] and Lang [10].) 
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Suppose for example that the oscillating system has six degrees of freedom and that 
the determinantal equation (15.14) is 


(2? —07)* 40? —@*)? O50" — w*) = 0, 


after factorisation, where (2 is a positive constant. The normal frequencies are then @| = 
Q (double root), @2 = 2Q (triple root) and w3 = 5&2 (simple root). There are therefore two 
normal modes associated with the normal frequency 1, three normal modes associated 
with the normal frequency w2, and one normal mode associated with the normal frequency 
«3. The total number of normal modes is six, which is equal to the number of degrees of 
the system. 


Definition 15.6 Degenerate frequencies If a normal frequency has more than one 
normal mode associated with it, then that frequency is said to be degenerate. 


In the example above, the normal frequencies @; and w2 are degenerate, but 3 is not. 
The notion of degeneracy is important in quantum mechanics, where normal frequencies 
correspond to the energies of stationary states. An unperturbed atom may have an energy 
level E that is (say) five-fold degenerate. When the atom is perturbed (by a magnetic field, 
for example) the energies of five states may be changed by differing amounts so that the 
energy level is ‘split’ into five nearly equal levels. This is an important effect in the theory 
of atomic spectra. 


Existence of normal modes 


e For any oscillating system, the roots of the eigenvalue equation 
det (V—AT) =0 


are all real and positive and their values are the squares of the normal frequencies 
{w;} of the system. 
e if wr is a simple root of the eigenvalue equation, then the equations 


(V—w{T)-a=0 


for the amplitude vector a have a non-trivial solution that is unique to within a 
multiplied constant. There is therefore only one normal mode associated with the 
normal frequency @. 

e More generally, if the root Ort is repeated k times, then the equations for the 
amplitude vector a have k linearly independent solutions. The normal frequency 
@, is then degenerate with k normal modes associated with it. 

e In all cases, an oscillating system with n degrees of freedom has a total of n 
normal modes. 
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a a a a 


FIGURE 15.4 Transverse oscillations of three particles attached to a stretched string. 
(The particle displacments, which should be small, are shown large for clarity.) 


15.5 SOME TYPICAL NORMAL MODE PROBLEMS 


The determination of normal modes of vibration is an important subject with 
applications in physics, chemistry and mechanical engineering. The following three prob- 
lems are typical. The first involves the transverse oscillations of a loaded stretched string; 
such problems make popular examination questions! 


Example 15.4 Transverse oscillations of a loaded stretched string 


A light string is stretched to a tension Tg between two fixed points a distance 4a apart 
and particles of masses 3m, 4m and 3m are attached to the string at equal intervals, 
as shown in Figure 15.4. The system performs small plane oscillations in which the 
particles move transversely, that is, at right angles to the equilibrium line of the string. 
Find the frequencies and forms of the normal modes. 


Solution 
Although it is clear by symmetry that purely longitudinal modes exist, it is not obvious 
that purely transverse modes exist. This question is investigated in Problem 15.5, 
where it is shown that the longitudinal and transverse modes uncouple in the linear 
theory. In this setting, purely transverse modes do exist and can be found by setting 
the longitudinal displacements equal to zero, which is what we will do here. 

Let the transverse displacements of the three particles be y;, y2, y3, as shown. 
Then the extension A, of the first section of string is given by 


where the neglected terms have power four (or higher) in the small quantity y;. Con- 
sider now the potential energy V of this section of string. If the string had no initial 
tension then V; would be given by V; = 5a}, where «@ is the ‘spring constant’ of 
the first section of the string. However, since there is an initial tension To, the formula 
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is modified to 


vi y? ° 
Vi = ToA, + xa AT = Tp eae + Sa ae 


ms Toy; 
2a 


fees, 


where the neglected terms have power four (or higher) in the small quantity y;. Note 
that, in the quadratic approximation, the spring constant a does not appear so that the 
increase in tension of the string is negligible. 

In the same way, the potential energies of the other three sections of string are 
given by 


_ Tor —y\)? aay, —_ To(y3 — y2)" ees Vi Toy3 


V. ; , rig 
2a 2a . 2a 


and the total approximate potential energy is given by 


app _ Toy; es To(y2 — yi)? a To(y3 — ya)" 5 Toy3 


V 
2a 2a 2a 2a 
To 
=5, (2x7 + 2y3 + 2y3 — 2y1y2 - 2y2ys) 


The V -matrix is therefore 


Dat % 
T 

V=5 1 
(reat 2 


In this problem, the exact and approximate kinetic energies are the same, namely 
T = T*? = 5(3m) yi + 5 (4m) 53 + 53m)53, 
so that the T -matrix is 
300 
T= sm 040 
003 
The eigenvalue equation det(V — AT) = 0 can therefore be written 
2-—3u -1 0 
—-1 2-4p -1 |=0, 
0 —1 2-—3py 
where ps = maw*/Tp. When expanded, this is the cubic equation 


(Se = 330" +17 2—2=0 
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for the parameter jz. Such an equation would have to be solved numerically in general, 
but problems that appear in textbooks (and in examinations!) are usually contrived so 
that an exact factorisation is possible; this is true in the present problem. Sometimes 
a factor can be spotted while the cubic is still in determinant form. In the present 
case, one can see that, by subtracting the third row of the determinant from the first, 
the cubic has the factor 2 — 3. If no factor can be spotted in this way, then one must 
try to spot that the expanded cubic equation has a (hopefully small) integer root. In 
the present case, one would have to spot that 4. = | is a root of the expanded cubic. 
By using either method, our cubic equation factorises into 


(64 —)Gu-2u—-)=0, 


and its roots are 4; = 1/6, w2 = 2/3, w3 = 1. Since u = maw*/To, the normal 
frequencies are given by 


To 
2 2 
=, @ = =, oO, = —. 

2 oe 
Since the normal frequencies are non-degenerate, the corresponding amplitude vec- 
tors are unique to within multiplied constants. In the slow mode, jz = 1/6 and the 
equations (V — AT) - a = 0 for the amplitude vector a become 


9-6 0 a} 0 
—6 8-6]-;a]=]0], 
0-6 9 a3 0 


on clearing fractions. It is evident that aj = 2, a2 = 3, a3 = 2 is a solution so that the 
amplitude vector for the mode with frequency @ is aj = (2, 3, 2). The other modes 
are treated in a similar way and the amplitude vectors are given by 


2 1 1 
aj=/3]., a= 0], az=]—-l 
2 —l 1 


These are the forms of the three normal modes. (It is a good idea to sketch the shapes 
of the three modes.) 


Our second example is concerned with the internal vibrations of molecules, an 


important subject in physical chemistry. Although such problems should really be treated 


using quantum mechanics, the classical theory of normal modes gives much valuable 


information with far less effort. It would also be very difficult to understand the quantum 


treatment of molecular vibrations without first having studied the classical theory. The 


simplest case in which there is more than one frequency is the linear triatomic molecule. 


In this case the three atoms lie in a straight line and can perform rectilinear oscillations. 


The classic example of a linear triatomic molecule is carbon dioxide. 
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M m M 


FIGURE 15.5 A simple classical model of a linear 
symmetric molecule. 


Example 15.5 The linear triatomic molecule 


A symmetric linear triatomic molecule is modelled by three particles connected by 
two springs, arranged as shown in Figure 15.5. Find the frequencies of rectilinear 
vibration of the molecule and the forms of the normal modes. 


Solution 

Let the centre atom have mass m, the outer atoms have mass M, and the springs have 
constant a. In this context, the spring constant is a measure of the ‘strength’ of the 
chemical bond between the two atoms. (We will suppose that the interaction between 
the two outer atoms is negligible.) Let the displacements of the three atoms from 
their equilibrium positions be x1, x2, x3 as shown in Figure 15.5. Then the kinetic 
and potential energies of the molecule are given by 


1 «2 Lee 1 «2 
T= T *PP = 3Mx; + ZNXy + 53MXx3, 


Yar = 5au(x2 =x)? + 50(x3 — x2)* 


= sou (32 + 2x4 + Ke — 2x,x2 — 2x2x3.) 


The T- and V- matrices are therefore 


M0 0 1-1 0 
T=}{0mo0], V=5a]-1 2-1 
00M 0-1 1 


Although everything looks normal, this problem has a non-standard feature in that the 
potential energy of the molecule does not have a true minimum* at x, = x2 = x3 = 0. 
This is actually clear from the start since the potential energy is unchanged if the 
whole molecule is translated to the right or left. Strictly speaking then, our theory 
does not apply to this problem, but, fortunately, only minor modifications are needed. 
In cases like this, it turns out that one or more of the normal frequencies is zero. 
These zero frequencies do not correspond to true normal modes. They correspond 
to uniform translational (or rotational) motions of the molecule as a whole. In the 
present case, the only uniform motion allowed is translational motion along the line 
of the molecule, so that we expect just one of the normal frequencies to be zero. 


* As a result, the V-matrix is not positive definite. 
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The eigenvalue equation det(V — AT) = 0 can be written 


l-p —1 0 
= Qey "gp =1 |=, 
0 —1 l-w 


where 4 = Maw?/a and y = M/m. The roots of this cubic equation are easily found 
to be uw = 0, w = 1 and uw = 1+2y. The zero root corresponds to uniform translation 
and the other two are genuine oscillatory modes with vibrational frequencies given 


by 


> (U+2y)a 
. @) = MM 


The amplitude vectors corresponding to the vibrational modes are 


—1 1 
a, = OO}, a2=| —2y |, 
1 1 


respectively. We see that, in the slow w,-mode, the centre particle remains at rest 
while the outer pair oscillate symmetrically about the centre. This is called the sym- 
metric stretching mode of the molecule. The fast w2-mode, in which all three par- 
ticles move with the outer atoms remaining a constant distance apart, is called the 
antisymmetric stretching mode. 


Comparison with experiment 

Since the value of the constant @ is unspecified, it can be chosen to fit any observed 
frequency. However, the frequency ratio w2/@ (= (+2y)¥ >) is independent of a 
and therefore affords a check on the theory. 

The vibrational frequencies of real molecules can be measured with great accu- 
racy by infrared and Raman spectroscopy. Spectroscopists measure the wavelength A 
of radiation that excites each vibrational mode. The mode frequency is proportional to 
the reciprocal wavelength A~!, the standard units being cm~!. Table 2 compares the 
observed and theoretical results for carbon dioxide and carbon disulphide,* both of 
which have linear symmetric molecules. 

The theoretical values of the frequency ratio w2/@ are within about 8% of those 
measured experimentally, which, considering the simplicity of the theory, is very good 
agreement. 

Some more examples on vibrating molecules are given in the problems at the 
end of the chapter. The standard reference on this subject is the monumental work of 
Herzberg [13], Volume II. m 


Our third example involves a rigid body suspended by three strings. Problems 


of this type tend to be difficult because the constraints make it difficult to calculate the 


* The atomic weights of carbon, oxygen and sulphur are C = 12,0 = 16, S = 32. 


15.5 Some typical normal mode problems 


Molecule ay) em!) ie en?) eee (1+2y)!/2 
O=€-6 1337 2349 1.76 1.91 
S-C=5 657 1532 253 2.52 


Table 2 Vibrational frequencies of linear triatomic molecules; 
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FIGURE 15.6 A flat plate of general shape is suspended by three equal strings. 


potential energy. It is all the more surprising then that the following problem has a simple 
exact solution. 


Example 15.6 Plate supported by three strings 


A flat plate of general shape and general mass distribution is suspended in a horizontal 
position by three vertical strings of equal length. Find the normal frequencies of small 
oscillation. 


Solution 

The system is shown in its equilibrium position in Figure 15.6, top left. The three 
strings are of length @ and are attached to the points A, B, C of the plate. The point 
K (see Figure 15.6, top right) is the centre of the circle that passes through A, B,C. 
Our initial choice of generalised coordinates is shown in Figure 15.6, bottom left. 
Xx, Yx are the horizontal Cartesian* displacement components of the point K from 


* These can be any Cartesian axes Koxyz with Kgz pointing vertically upwards; {i, j, k} are the corre- 
sponding unit vectors. 
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its equilibrium position Ko, and 6@ is the rotation angle of the plate about the vertical 
axis through Ko.* Three coordinates are sufficient since the three string constraints 
reduce the number of degrees of freedom of the plate from six to three. 

We will now calculate the potential energy of the plate in terms of the coordi- 
nates Xx, Yx and @. This is the tricky step since the plate does not remain horizontal 
and this complicates the geometry. However, the vertical displacement of any point of 
the plate is guadratic in the small quantities Xx, Yx, 6 and, providing care is taken, 
this enables us to use approximations. Consider first Aa, the displacement of the 
point A. This is given approximately by 


Aa=Xxi+Yxj+(0k)xa+---, 


correct to the first order in small quantities, where a is the position vector of A relative 
to Kg in the equilibrium position. As expected, this displacement is horizontal, correct 
to the first order in small quantities. The square of the magnitude of this horizontal 
displacement is therefore 


(XeitYej+0kxa) 


= XK +VE +6" |al? + 2X KO (i+ (kx a)) + 2V KO (j + (kxa)) 
=X24Y2 + R70? + 2¥K0(a- i) —2XxK0(a: j), 


correct to the second order in small quantities, where R (= |a]|) is the radius the circle 
passing through A, B and C. Since A is one of the points that is suspended by a string 
of length £, an application of Pythagoras shows that the vertical displacement of the 
point A is given by 


Xi +Y2 + R767 2YK0 . 2XKO ; 
u=( y) +( y) )a-n-( oY] Jaa. 


correct to the second order in small quantities. Similar expressions exist for zg and 
zc with a replaced by b and ¢ respectively. Now for the clever bit. Since the plate is 
flat, it follows that a general point of the plate with position vector r relative to Ko in 
the equilibrium position must have vertical displacement* 


2 2 292 
_ (*# +YZ + R70 )+ (2x*) a (x") rf) (15.21) 


2€ 2€ 2£ 


in the displaced position, correct to the second order in small quantities. This expres- 
sion confirms that the plate is not generally horizontal in the displaced position. 


* As we will see, the plate does not remain horizontal and so the angle 6 ought to be defined more carefully. 
Let K P be any line fixed in the plate and let the projection of this line on to the equilibrium plane of the 
plate be K’P’. The angle 6 can be properly defined as the angle turned through by the line K’P’. This is 
not quite the same as the angle turned through by the projection of some other line lying in the plate, but 
the differences are quadratic in the small quantities X x, Yx and @ and will turn out to be immaterial. 

+ This is because the expression (15.21) is linear in r and gives the correct z-values at A, B, C. 
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The purpose of all this is to calculate the potential energy V = Mgzc, where M 
is the mass of the plate and zg is the vertical displacement of the centre of mass G. 


With no loss of generality we may take the axis Ko. to point in the direction KgGo so 
that rg = Di, where D is the distance K G. The general formula (15.21) then shows 


that 
XZ + Ye + R70? 4 (2X9) p 
2¢ 26 
Xx + Vx + D0) + (R* — D*)0? 
2¢ 


correct to the second order in small quantities. Hence the approximate potential 
energy is given by 


M 
yeep — + [xz +(x + D6)? + (R?- b)6?| 


This formula can be simplified further by a change of generalised coordinates. 
Let XG, Yg be the horizontal Cartesian displacement components of the centre of 
mass G from its equilibrium position Go, and 0g be the rotation angle of the plate 
about the vertical axis through Go. Then, correct to the first order in small quantities, 


Xg=Xkx, Yo =Y¥x+D90, 6g = 86. 


In terms of the generalised coordinates XG, Yg, @ (we will drop the subscript 
from 6g from now on) the expression for V #PP is 


M 
yar — ~ (x2 +72 + (R?- b*) 6?) 


and the V -matrix is 


10 0 
M 
v=> 01 0 
0 0R2— D2 


a remarkably simple result in the end. 
The approximate kinetic energy is calculated when the plate is passing through 
its equilibrium position. This is simply 


T*PP = SMXG + 5MYG +410", 


where / is the moment of inertia of the plate about the axis through G perpendicular 
to its plane. If we write 7 = Mk?, then the T-matrix becomes 


0 
0 


10 
T=3M{01 
00k 
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The normal frequencies are therefore given by 


2 2 
wr = ; (doubly degenerate), ws = (=*) A 
which correspond to two translational modes and one rotational mode. 
In particular, if the lamina is a uniform circular ring of radius a, then R = a, 
D = Oandk =a. Then w2 = a and the system has the single triply degenerate 
normal frequency (g/2)!/*. In this case, any small motion of the system is periodic 
with period 27(€/g)'/?. m 


15.6 ORTHOGONALITY OF NORMAL MODES 


In this section we will show that the n amplitude vectors of the normal modes 
of an oscillating system are mutually orthogonal, in a sense that we will make clear. This 
is an important theoretical result, but it is not needed in the practical solution of normal 
mode problems. We will make use of orthogonality in our treatment of normal coordinates 
in section 15.8. 

The basic theorem on orthogonality of eigenvectors is as follows: 


Theorem 15.2 Orthogonality of eigenvectors Suppose K and Lare symmetric n xn 
matrices and that X, and X2 are generalised eigenvectors of K (with respect to the matrix 
L) belonging to distinct eigenvalues. Then X, and Xz are mutually orthogonal (with 
respect to the matrix L) in the sense that they satisfy the relation 


x, -L- x. =0. 
Proof. Suppose that 41 and Ay are distinct eigenvalues with the corresponding eigenvectors X; and X 
respectively. Consider the scalar quantity 
Xi > K-X2 =X) - (K+ Xp) =X} + (AgL - Xp) 
=2 (x) -L-Xp9). 
However, the same quantity can also be written 
X) - K+ Xp = (K+ x1)! © Xo = (K+ xy)! + Xp = (APL 1)! © Xp = Aq (KY L’ + Xp) 

= Aj (x, +L x), 

since K and L are both symmetric. It follows that 
Aq = A2) (x) -L. X2) = 0, 

and, since A; # A2, that 


x, -L-x. =0. 


Since the matrices V and T of an oscillating system are both real and symmetric, 
the above theorem applies to normal mode theory. It follows that if a; and az are the 
amplitude vectors of two normal modes of an oscillating system with distinct frequencies, 
then 


a -T- a. =0. 
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This result is not necessarily true for amplitude vectors that belong to the same (degener- 
ate) frequency, but they can always be chosen to do so. If this has been done, then the full 
set of amplitude vectors a,, a2, ...a@, are mutually orthogonal. 


Orthogonality of normal modes 


The amplitude vectors a1, a2, ...a@, of the normal modes of an oscillating system 
satisfy (or can be chosen to satisfy) the orthogonality relations 


a’. -T-a, =0 (j #k). (15.22) 


For theoretical purposes, it is also convenient to normalise the amplitude vectors. 
Since T is a positive definite matrix, the quantities a -T - a), a@,-T-+@,...,a,+T-a, 
are all positive. It follows that the amplitude vectors can be scaled so that 


a, -T-aj=a,-T-a=---=a),-T-a, =1, (15.23) 


in which case they are said to be normalised. The orthogonality and normalisation rela- 
tions (15.23), (15.23) can then be combined into the single set of relations 


0 GFK) 


15.24 
1 G=h ra 


a)-T-a =| 


called the orthonormality relations. 


Rayleigh’s minimum principle 
As an application of the orthogonality relations, we will now prove a far reaching result 


known as Rayleigh’s minimum principle. Suppose an oscillating system S with n degrees 
of freedom has potential and kinetic energy matrices V and T. Consider the function 


x’-V-x 


eae oe 


(15.25) 


where X is any non-zero column vector of dimension n. The function F(X) is called 
Rayleigh’s function for the system S and it has some interesting properties. To keep 
things simple we will suppose that S has no degenerate normal frequencies. 


Theorem 15.3 Rayleigh’s minimum principle Suppose that an oscillating system S 
has Rayleigh function F (x). Then 


F(x) > of (15.26) 
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for all non-zero column vectors X, where @, is the fundamental frequency* of S. The 
minimum value is achieved when X is a multiple of the amplitude vector the fundamental 
mode. 


Proof. Let the n normal frequencies be ordered so that mw; < w2 < --- < @, and let the 
corresponding amplitude vectors be a1, a2, ..., @). We will suppose that the amplitude vectors 
have been normalised so that they satisfy the orthonormality relations (15.24). 

Now let x be any column vector. Since the n amplitude vectors form a basis set," X can be 
expanded in the form X = aa, + a2@ +---+a,a,. Then 


xX’ -V-x=x’-V- (aya; + aa +---+a7a,) 
= a (x’-V-aj) +02 (x’-V- ay) +--+» +a, (x’-V- a) 
= ajo; (x + T+ ay) +0205 (x’+ T+ a2) +++» tana, (x + Tan). 


But 


x! + Ts ay = (aa) + aA +--+ + Onn) Ts aK 
= a (1+ T+ ax) +2 (2+ T+ ag) + +++ + On (An + T+ aK) 
= Ak 
on using the orthonormality relations. Hence 


DD Di 2 yee 
xX -Vex=ajoi taza; t+-:- +a, 0; 


and, by a similar argument, 


2 


n° 


xX -T-x=a%+05+---+0 
Hence 


Deen, 222), 2 02 DP ods 2.2) ies 
Oy Or May a On OO Oe 


FX) = Ee 
at taz+---+a? at taz;+---+a? 


Sasa 

=O} 
which is the required result. It is also evident that equality can only occur when a2 = a3 = 
++. = Q, = 0, that is, when X = aa). 


This result means that F (x) is an upper bound for wr, for any choice of the column 
vector X; this upper bound has been obtained without solving the oscillation problem. 
Moreover, if we could substitute every value of x into the function F (Xx), then the vectors 
that yield the least value of F must be multiples of the amplitude vector a). 


* The fundamental frequency is the lowest of the normal frequencies and the corresponding normal mode 
is the fundamental mode. 

* This follows because the column vectors a, @, ..., Ay Satisfy the orthogonality relations (15.22). A set 
of mutually orthogonal vectors must be linearly independent, and, since there are n of them, they form a 
basis for the space of column vectors of dimension n. 
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In normal mode theory, this result is of little consequence since the normal frequencies 
are simply the roots of a polynomial equation which can always be solved numerically. 
However, Rayleigh’s principle has extensions to many areas of applied mathematics and 
physics such as continuum mechanics and quantum mechanics. In these subjects, the 
oscillation problems often cannot be solved, even numerically, and Rayleigh’s principle is 
one of the few ways in which information can be gained about the fundamental mode. For 
example, in quantum mechanics Rayleigh’s minimum principle takes the form: 


Suppose S is a quantum mechanical system with Hamiltonian H and ground state 


energy E,. Then 


Ox|H}x) 


(x |x) m 


for any choice of the quantum state X. 


15.7 GENERAL SMALL OSCILLATIONS 


Normal modes are special small motions but, from them, we can generate the 
general solution of the small oscillation equations. The result is as follows: 


General solution of small oscillation equations 


The general solution of the small oscillation equations can be expressed as a linear 
combination of normal modes in the form 


q(t) = Cya, cos(@jt — y,) + C2a2 cos(@2t — y2) +--+ + Cn @n COS(@nt — Yn) 


where the amplitude factors {C ;} and phase factors {y;} are arbitrary constants. 


Proof. Suppose that q(t) is any solution of the small oscillation equations (15.11). We will now show 
that we can construct a linear combination of normal modes that satisfies the small oscillation equations 
and also satisfies the same initial conditions as q(t). To do this, take a general linear combination of normal 
modes in the form 


q*(t) = Cj a] cos(@jt — yy) + Cz2a2 cos(mgt — y2) +--+ + Cran COS(@nt — Yn) 


a, (Aj cos@,t + By sinw t) + az (A2 cos wt + Bz sinwt) + 
»++ An (An COS Ont + Bn Sina@nt) , 


on writing Cj cos(wjt — yj) = Aj cosw;jt + Bj sinw;t. Since the small oscillation equations are linear 
and hhainagsteout q*(t)isa <olatoi for all Gee of the coefficients {A ;}, {Bj}. We now need to choose 
the coefficients {A ;}, {B;} so that q *(0) = q(0) and q*(0) = q(0). This requires that the coefficients {Aj} 
be chosen so that 


Aja, + Agag + +--+ Anan = q(0), 
and that the coefficients {B ;} be chosen so that 


(By @ 1) ay + (B2@7) az +--+ (Bron) an = q(0). 
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This is always possible because the n amplitude vectors a,, a2, ..., @n form a basis for the space of vectors 
of dimension n. The vectors g(t) and q(t) can therefore be expanded in the required forms. 

We have thus constructed a solution g*(t) of the small oscillation equations that satisfies the same 
initial conditions as the solution g(t). But ODE theory tells us that there can be only one such solution and 
so q = q*. Since q can be any solution and q* is a linear combination of normal modes, it follows that any 
solution of the small oscillation equations can be expressed as a linear combination of normal modes. @ 


Example 15.7 General small motion of the double pendulum 


Find the general solution of the small oscillation equations for the double pendulum 
problem. 


Solution 
The normal modes for the double pendulum problem have been found to be 


6 = €, cos(./2/3nt — 1) 6= € cos(/2 nt — y2) 
ob = 2€1 cos(./2/3 nt — y1) |’ & = —2€2 cos(V/2nt — yr) J 


The general small motion is therefore 


6= €,cos(./2/3nt —y1) +62 cos(/2 nt — y2), 
b = 2€; cos(./2/3 nt — vy) — 2€2 cos(/2 nt — 2), 


where €1, €2, /|, 2 are arbitrary constants. ll 


General small motion not usually periodic 


The general small motion is a sum of periodic motions, but it is not usually periodic itself. 
Periodicity will occur only if there is some time interval t that is an integer multiple of 
each of the periods T1, T2, ..., T, of the normal modes. This only happens when the ratios 
of the normal mode periods are all rational numbers. In the double pendulum example, 
T/T2 = @2/a, = J/3, which is irrational. The general small motion is therefore not 
periodic. 


15.8 NORMAL COORDINATES 


The preceding theory applies for any choice of the generalised coordinates {q ;}. 
Changing the generalised coordinates will change the V - and T -matrices, but the normal 
frequencies and the physical forms of the normal modes will be the same. This suggests 
that it might be possible to make a clever choice of coordinates so that the V- and T- 
matrices have a simple form leading to a much simplified theory. In particular, it would 
be very advantageous if T and V had diagonal form. 


Definition 15.7 Normal coordinates A set of generalised coordinates in terms of 
which the T - and V -matrices have diagonal form are called normal coordinates. 


Actually, every oscillating system has normal coordinates, as we will now show. Let 
q be the original choice of coordinates with corresponding matrices V and T. Then 


TP — q-T-q, ver — q’-V-q. (15.27) 
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Now consider a change of coordinates from q to 9 defined by the linear transformation* 
q=P-n <— n=P!-q (15.28) 


where P can be any non-singular matrix. On substituting the transformation (15.28) into 
the expressions (15.27), we obtain 


rere — (P- 9)’ T-(P-4) =H: (P’-T-P)- a, 
vePP — (P-n)’-V-(P-n) =: (P’'-V-P)-n, 


from which we see that this transformation of coordinates causes V and T to be trans- 
formed as 


T> P’.T-P, V—P’.V-P. (15.29) 


Can we now choose the transformation matrix P so that the new T- and V-matrices are 


diagonal? 
Let a, a2, ..., @, be the amplitude vectors of the normal modes when they are 
expressed in terms of the coordinates g and let @1, @2, ..., @, be the corresponding 


normal frequencies. We will suppose that these amplitude vectors have been chosen so 
that they satisfy the orthonormality relations (15.24), that is 


0 GFK), 
e T e = 1 . 
aj a {‘ Gad. (15.30) 
Now consider the matrix P whose columns are the amplitude vectors {a ;}, that is, 
P = (aj|a|---| an). (15.31) 


Since the amplitude vectors are known to be linearly independent, P has linearly inde- 
pendent columns and is therefore a non-singular matrix. Let us now try this P as the 
transformation matrix. Then 


PT P= |— | 47+ @ila|Jap: 


* For example, in the case of two degrees of freedom, this transformation has the form 


4. = Pum + P22, 
q2 = p21 + P2272. 
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The jxk-th element of this matrix is given by 


z 


{0 G#h 
aj Tea ={{ (j=h) 


by the orthonormality relations. Hence, with this choice of P, 
P’.T-P=1, 
where 1 is the identity matrix. In the same way, 


/ 

Gan 
7 

rd 
3 -V- (aj|a|---|a,). 


a’ 


n 


The jk-th element of this matrix is given by 


al, Ve ay =al,- (Vea) =a) - (of - ax) = of (a, -T- ax) 


_|0 GF), 
~ wo; (=k). 


Hence, with this choice of P, 
P’.V.P=Q’, 


where Q is the diagonal matrix whose diagonal elements are the normal frequencies, that 
iS, 


ae Uo 0 
O ws 0 
Q=].. 
0. Oss aa 


We have thus succeeded in reducing both V and T to diagonal form. Hence the coordinates 
{n;} defined by (15.28) with P = (aj| @p|---|@,) are a set of normal coordinates. They 
are given explicitly by 


on using the formula P’ - T - P = 1. This can also be written in the semi-expanded form 


nj = (a, -T) q (l<j<n). (15.32) 
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From this last formula, we can see that, if the amplitude vectors {a;} are not normalised, 
then the coordinates {7;} are simply multiplied by constants. They are therefore still 
normal coordinates. The corresponding V - and T-matrices are still diagonal, but T is no 
longer reduced to the identity. 

Our results are summarised as follows: 


Finding normal coordinates 


Let a), a2, ..., a, be the amplitude vectors of the normal modes when expressed in 
terms of the coordinates {g;}. Then the coordinates {7 ;} defined by 


nj = (ai;-T) -q (l<j <n) 


are a set of normal coordinates, as are any constant multiples of them. (The ampli- 
tude vectors only need to be normalised if it is required to reduce the matrix T to the 
identity.) 


When expressed in terms of normal coordinates, the small oscillation equations 
become 


7+’ -n=0. 
In expanded form, this is 
ij toinj=0 (l<j<n), 


a system of n uncoupled SHM equations. The solution nj = C,cos(@\t — y1), 2 = 
n3 = ++: = Ny, = O is the first normal mode, the solution 72 = C2cos(w2t — y2), 


nN, = 3 =-++: = Ny = Ois the second normal mode, and so on. 


Note. Using normal coordinates is not a practical way of solving normal mode problems. 
Indeed the problem has to be solved before the normal coordinates can be found! Normal 
coordinates are important because they simplify further developments of the general theory. 


Example 15.8 Finding normal coordinates 


Find a set of normal coordinates for the double pendulum problem. 


Solution 
For the double pendulum problem, we have already found that 


5(41 1 i 
rowe(f) e(). me(2) 
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Hence, on dropping the inessential constant factor smb", a set of normal coordinates 


is given by 
41 0 
m=(1 2+ (4 1) (4) =9 +30 


m= -2)-(1 i): (4) =29-¢ 


Since normal coordinates may always be scaled, we can equally well take 


and 


ni = 20+, 
n2 = 20 — @, 


as our normal coordinates. 


Problems on Chapter 15 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Two degrees of freedom 


15.1 A particle P of mass 3m is connected to a particle Q of mass 8m by a light elastic spring 
of natural length a and strength w. Two similar springs are used to connect P and Q to the 
fixed points A and B respectively, which are a distance 3a apart on a smooth horizontal table. 
The particles can perform longitudinal oscillations along the straight line AB. Find the normal 
frequencies and the forms of the normal modes. 

The system is in equlilibrium when the particle P receives a blow that gives it a speed u 


a 
in the direction AB. Find the displacement of each particle at time f in the subsequent motion. 


15.2 A particle A of mass 3m is suspended from a fixed point O by a spring of strength w 
and a second particle B of mass 2m is suspended from A by a second identical spring. The 
system performs small oscillations in the vertical straight line through O. Find the normal 
frequencies, the forms of the normal modes, and a set of normal coordinates. 


15.3 Rod pendulum A uniform rod of length 2a is suspended from a fixed point O by a light 
inextensible string of length b attached to one of its ends. The system moves in a vertical plane 
through O. Take as coordinates the angles 0, @ between the string and the rod respectively 
and the downward vertical. Show that the equations governing small oscillations of the system 
about 06 = ¢ = O are 


bé + ab = —g9, 


b6 + ab = —g¢. 
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For the special case in which b = 4a/5, find the normal frequencies and the forms of the 
normal modes. Is the general motion periodic? 


Three or more degrees of freedom 


15.4 Triple pendulum A triple pendulum has three strings of equal length a and the three 
particles (starting from the top) have masses 6m, 2m, m respectively. The pendulum performs 
small oscillations in a vertical plane. Show that the normal frequencies satisfy the equation 


12? — 60u? + 81 — 27 =0, 


where 4. = aw*/g. Find the normal frequencies, the forms of the normal modes, and a set of 
normal coordinates. [j2 = 3 is a root of the equation. ] 


15.5 A light elastic string is stretched to tension 7p between two fixed points A and B a 
distance 3a apart, and two particles of mass m are attached to the string at equally spaced 
intervals. The strength of each of the three sections of the string is wa. The system performs 
small oscillations in a plane through AB. Without making any prior assumptions, prove that 
the particles oscillate longitudinally in two of the normal modes and transversely in the other 
two. Find the four normal frequencies. 


15.6 Arod of mass M and length L is suspended from two fixed points at the same horizontal 
level and a distance L apart by two equal strings of length b attached to its ends. From each 
end of the rod a particle of mass m is suspended by a string of length a. The system of the rod 
and two particles performs small oscillations in a vertical plane. Find V and T for this system. 
For the special case in which b = 3a/2 and M = 6m/5, find the normal frequencies. Show 
that the general small motion is periodic and find the period. 


15.7 A uniform rod is suspended in a horizontal position by unequal vertical strings of lengths 
b, c attached to its ends. Show that the frequency of the in-plane swinging mode is ((b + 
c)g/2bc)'/*, and that the frequencies of the other modes satisfy the equation 


beu? — 2a(b +c) + 3a” = 0, 


where 4 = aw*/g. Find the normal frequencies for the particular case in which b = 3a and 
c= 8a. 


15.8% A uniform rod BC has mass M and length 2a. The end B of the rod is connected to a 
fixed point A on a smooth horizontal table by an elastic string of strength a, and the end C is 
connected to a second fixed point D on the table by a second elastic string of strength a2. In 
equilibrium, the rod lies along the line AD with the strings having tension To and lengths b, c 
respectively. Show that the frequency of the longitudinal mode is ((a + a@2)/M)!/* and that 
the frequencies of the transverse modes satisfy the equation 


b?c? u* — 2bc(2ab + 3bc + 2ac)u + 6babc(2a +b +c) = 0, 


where ps = Maw?/Tpo. [The calculation of V*?P is very tricky. ] 
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Find the frequencies of the transverse modes for the particular case in which a = 3c and 
b=Se. 


15.9% A light elastic string is stretched between two fixed points A and B a distance (n + 1)a 
apart, and n particles of mass m are attached to the string at equally spaced intervals. The 
strength of each of the n+1 sections of the string is ~7. The system performs small longitudinal 
oscillations along the line AB. Show that the normal frequencies satisfy the determinantal 
equation 


2cos@ —-l1 O.:-- O 0 
—1 2cosO-1--. O 0 
A=| oi: ic, ff [=0, 
0 0 0 --- 2cos@ —1 
0 0 0 --- —1 2cosé 


where cos @ = 1 — (mw?/2a). 
By expanding the determinant by the top row, show that A,, satisfies the recurrence rela- 
tion 


An = 2cosOAy_1 — An_2, 
for n > 3. Hence, show by induction that 
An = sin(n + 1)0/siné@. 
Deduce the normal frequencies of the system. 


15.10 A light string is stretched to a tension Tp between two fixed points A and B a distance 
(n + 1)a apart, and n particles of mass m are attached to the string at equally spaced intervals. 
The system performs small plane transverse oscillations. Show that the normal frequencies 
satisfy the same determinantal equation as in the previous question, except that now cos@ = 
1 — (maw* /2To). Find the normal frequencies of the system. 


Vibrating molecules 


15.11 Unsymmetrical linear molecule A general linear triatomic molecule has atoms A, 
A2, A3 with masses m1, m2, m3. The chemical bond between A, and A2 is represented by a 
spring of strength a 12 and the bond between Ap? and A3 is represented by a spring of strength 
a23. Show that the vibrational frequences of the molecule satisfy the equation 


4 2 
mym2m3 @° — [ay2m3(m, + m2) + @23m1 (m2 + m3)) @ 


+ &12%93(m 1 + m2 + m3) = 0. 


Find the vibrational frequencies for the special case in which m, = 3m, m2 = m, m3 = 2m 
and a12 = 3a, a23 = 2a. 
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FIGURE 15.7 Vibrations of a symmetric V-shaped molecule. Left: a symmetric 
motion, Right: an antisymmetric motion. 


The molecule O—C-—S (carbon oxysulphide) is known to be linear. Use the 1 ie val- 
ues given in Table 2 to estimate the ratio of its vibrational frequencies. [The experimentally 
measured value is 2.49.] 


15.12% Symmetric V-shaped molecule Figure 15.7 shows the symmetric V-shaped triatomic 
molecule X Y2; the X—Y bonds are represented by springs of strength k, while the Y—Y 
bond is represented by a spring of strength « k. Common examples of such molecules include 
water, hydrogen sulphide, sulphur dioxide and nitrogen dioxide; the apex angle 2a is typically 
between 90° and 120°. In planar motion, the molecule has six degrees of freedom of which 
three are rigid body motions; there are therefore three vibrational modes. It is best to exploit the 
reflective symmetry of the molecule and solve separately for the symmetric and antisymmetric 
modes. Figure 15.7 (left) shows a symmetric motion while (right) shows an antisymmetric 
motion; the displacements X, Y, x, y are measured from the equilibrium position. Show that 
there is one antisymmetric mode whose frequency w3 is given by 


k 
w3 = rs + 2m sin? a), 


and show that the frequencies of the symmetric modes satisfy the equation 
w= (1 +2y cos” a + 2€) p+ 2€ cos? a(1 + 2y) = 0, 
where fs = mw?/k and y = m/M. 
Find the three vibrational frequencies for the special case in which M = 2m, a = 60° and 


e=1/2. 


15.13 Plane triangular molecule The molecule BCl; (boron trichloride) is plane and sym- 
metrical. In equlibrium, the Cl atoms are at the vertices of an equilateral triangle with the 
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B atom at the centroid. Show that the molecule has six vibrational modes of which five are 
in the plane of the molecule; show also that the out-of-plane mode and one of the in-plane 
modes have axial symmetry; and show finally that the remaining four in-plane modes are in 
doubly degenerate pairs. Deduce that the BCl3 molecule has a total of four distinct vibrational 
frequencies. 


Computer assisted problem 


15.14 Sulphur dioxide molecule Use computer assistance to obtain an equation satisfied by 
the squares of the frequencies of the symmetric modes of a V-shaped molecule. For the special 
case in which M = 2m and a = 60°, show that the frequencies of the symmetrical modes 
satisfy the equation 


4u> — (5+ 8e)u+4e = 0, 


where 2 = ma@?/k. 

The sulphur dioxide molecule O—S—O has mass ratio M/m = 2 and an apex angle 
very close to 120°. Its infrared absorbtion wave numbers are found to be i = 1151 em", 
ie = 525cm7!, a = 1336 cm7!. Show that there is no value of ¢ that fits this data with 
reasonable accuracy. This is a deficiency of our simple (central force) model of interatomic 


forces, which gives poor results for V-shaped molecules (see Herzberg [13]). 


Chapter Sixteen 


Vector angular velocity 
and rigid body kinematics 


KEY FEATURES 
The key features in this chapter are vector angular velocity and the kinematics of rigid bodies 
in general motion. 


This chapter is concerned with the kinematics of rigid bodies in general motion. 
In Chapter 2 we considered only those rigid body motions that were essentially two- 
dimensional, and angular velocity appeared there as a scalar quantity. In general three- 
dimensional rigid body motion, this approach is no longer adequate and angular velocity 
must be introduced in its proper role as a vector quantity. The principal result of the 
Chapter is that any motion of a rigid body can be represented as a sum of translational 
and rotational contributions. 


16.1 ROTATION ABOUT A FIXED AXIS 


In this chapter we adopt a more rigorous approach to rigid body rotation than we 
did in Chapter 2. We begin with a proper definition of rigidity. 


Definition 16.1 Rigidity A body B is said to be a rigid body if the distance between 
any pair of its particles remains constant. That is, if P; and P; are typical particles of B 
with position vectors rj(t) and r ;(t) at time t, then 


Iri(t) —rj(t)| =cij, (16.1) 


where the cj; are constants. 


Suppose a rigid body B is rotating about a fixed axis with angular speed w. This 
motion certainly satisfies the rigidity conditions (16.1). Let 7 be a unit vector parallel to 
the rotation axis. Then the vector angular velocity of B is defined as follows: 


Definition 16.2 Vector angular velocity The angular velocity vector of the body B is 
defined to be 


@o=ton, (16.2) 


458 Chapter 16 Vector angular velocity and rigid body kinematics 


FIGURE 16.1 The rigid body 6 rotates with angular speed w about a fixed 
axis parallel to the unit vector 7. Its angular velocity vector is defined by 
@ = +7, where the sign is determined by the sense of the rotation. 


where the sign is taken to be plus or minus depending on whether the sense of the rotation 
(relative to the vector N) is right- or left-handed. These senses are shown in Figure 16.1. 


Note. From the vector w we can deduce the angular speed, the axis direction, and the 
rotation sense about the axis. It tells us nothing about the position of the axis either in 
space or in the body. 


Example 16.1 Calculation of w 


A rigid body 6 is rotating with angular speed 7 radians per second about a fixed axis 
through the points A(2, 3, —1), B(—4, 0, 1). The rotation is in the left-handed sense 
— 


relative to AB. Find the angular velocity of B. 


Solution 


The position vectors of the points A and B area = 2i+3 jy —kandb=—4i+kso 
that 


AB=b-a=-—6i—3j+2k. 


The vector 7 is then 
gq = Oi=3A+2K _ -61-3j +2 
i 7 ; 


The rotation sense is left-handed relative to the direction of m so that the angular 
velocity of B is 


@=-7Nn 


1 (Soe) 
7 7 


=6i+3j—2k radians per second. & 
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Particle velocities and accelerations 


The particle velocities can be conveniently calculated in terms of the vector w. Let P be 
a particle of B with position vector r relative to an origin O located on the rotation axis 
(see Figure 16.1). Then the velocity of P has the same direction as the vector wxr. Hence 
v can be written in the form 


v=i@xYr, 


where A is a positive scalar. To determine A, consider the magnitude of each side. The 
magnitude of v is the circumferential speed wp (see Figure 16.5) and so 


op =A\@xr|=Alo||r| sina = Aw (OP sina) 


= hap. 
Hence A = | and the formula for v is 
v=@xr. (16.3) 


This formula applies only when the origin of vectors lies on the rotation axis, but a 
more general result is easy to obtain. Let B be any fixed point on the rotation axis. Then 


—> 
the velocity formula (16.3) still holds if the position vector r is replaced by BP= r — BD, 
where BD is the position vector of the point B. Hence the general formula for the velocity 
of P is given by 


v=@x(r-—b) (16.4) 


where B is any point on the rotation axis. 


Example 16.2 Finding particle velocities and accelerations 


A rigid body is rotating with constant angular speed 7 radians per second about a 
fixed axis through the points A(2, 3, —1), B(—4, 0, 1), distances being measured in 


centimetres. The rotation is in the left-handed sense relative to AB. Find the instan- 
taneous velocity, speed, and acceleration of the particle P of the body at the point 
(=3, 3; 5). 


Solution 
The angular velocity of this body has been determined in the last example to be 


@=6i+3j—-—2k radians per second. 


The velocity of P can now be found using (16.4) with 


r=—3i+3j+5k and b=—4i+k. 
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This gives 


v= (61+3j-2HxG4+3j7 +44) 


ij k 
=|6 3-2 
13 4 


18i—26j+15k cms !. 


This is the instantaneous velocity of P. The speed is therefore |v| = 
(182 + (—26)2 + 152)" = 35 ems“! 

The acceleration of P can be found by differentiating the formula (16.4) with 
repect to ¢. This gives 


a= @x(r—b)+@x(r—b). 


But @ is known to have constant direction and magnitude and so w = 0. Also b=0 
since B is a fixed particle. This leaves 


a=oxr 

=@xv 

= (61+ 37 —2k)x (18i — 267 + 15k) 
i jk 

=|6 3-2 
18 —26 15 


= —7i-126j—210k cms ?.m 


16.2 GENERAL RIGID BODY KINEMATICS 


We now move on to the more general case in which the rigid body does not have 
a fixed rotation axis. We first consider a rigid body that has one particle O that does not 
move, and we take O to be the origin of position vectors. Now the rigidity conditions 
(16.1) are equivalent to 


(ri-ry):GWi-ry)= c, (16.5) 
and because O is a particle B it follows in particular that 
rier) =di, (16.6) 


where the d; are constants. On expanding the dot product in (16.5) and using (16.6) we 
obtain 


rjierj = eij, (16.7) 
where the e;; are constants. If we now differentiate (16.7) with respect to t we obtain 
rierjt+rerj =0 forall i, j, (16.8) 


which is our preferred form of the rigidity conditions. 
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We now prove the fundamental theorems of rigid body kinematics. The details of 
the proofs are mainly of interest to mathematics students. 


Theorem 16.1 Existence of angular velocity | Let a rigid body B be in motion with 
one of its particles O fixed. Then there exists a unique vector w(t) such that the velocity 
of any particle P of B is given by the formula 


v=@xr, (16.9) 


where r is the position vector of P relative to O. 


This result means that, at each instant, 6 is rotating about an instantaneous axis 
through O. This axis is not fixed in space or in the body. 


Proof. Suppose that there exist particles E;, E2, E3 of B such that their position vectors 
{e1, e2, e3} relative to O form a standard basis set. Then if there does exist an @ satisfying 
(16.9), it must in particular satisfy 


ee = WX ex (16.10) 
for k = 1, 2, 3. Taking the cross product of this equation with e; gives 


er X ee = CL X (WX ex) = (Ex + Cx) W — (W+ Ex) eK 


= W— (@+ ek) e& 
since eg is a unit vector. Summing these equations over | < k < 3 gives 


So ex xe = 3@— ° + ex) ex = 30-0 
k k 


since the sum on the right is just the expansion of w with respect to the basis set {e1, e2, e3}. 
Hence 


1 
eee (16.11) 


Thus if @ does exist, it must be given by the formula (16.11), which shows that @ is unique. 
We must now show that this @ satisfies (16.9) for all the particles of the body. It is a simple 
exercise to verify that this is true for the particles E,, E2, E3 by substituting (16.11) into 
(16.10) and using the rigidity conditions (16.8). Now let P be any other particle of the body 
and expand its position vector r with respect to the basis set {e, e2, e3} in the form 


r=) (r-e)e. 
k 


The velocity of P is then given by 


vara) (Feegtr-e&)ex + >) (r-ex) ee. 


k k 
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Now r-e, +r e = 0 by the rigidity conditions (16.8), and we have directly verified that 
eé; = wx e;. Hence 


v=) (r+ ex) (@xex) = ox) (r+ ex) e 
k k 
=oxr 


as required. i 


The above proof cannot even begin if any of the particles E,;, E>, E3 are not actually present (for example 
the body could be a lamina). In such a case, suppose that the body has at least two particles A, B in addition 
to O, and that O, A, B are not collinear. Then define the standard basis set {e;, e2, e3} by 


a b-(a-b)a 


1 Tal Tb (@- Bal’ 


e3 =e] Xe2. 


It can be shown that the points of space E, Ez, E3 that have the position vectors e;, e2, e3 satisfy the same 
rigidity conditions as the real particles of the body. They can therefore be regarded as real particles and the 
proof given above then holds. 


We now extend the result in Theorem 16.1 to the case of completely general motion 
in which no particle of the body is fixed. 


Theorem 16.2 Existence of angular velocity Il Suppose a rigid body is in completely 
general motion and let B be any one of its particles. Then there exists a unique vector @(t) 
such that the velocity of any particle P of the body is given by the formula 


v=v? + wx(r—b), (16.12) 


where r and bare the position vectors of P and B, and v® is the velocity of B. 


Proof. We view the motion of B from a reference frame F’ with origin at B and moving without rotation 
relative to the original frame F. Then (see section 1.4) the position vector r’ and velocity v’ of a particle P 
relative to F’ are related to the original r and v by 


r—r +b, v= +08, (16.13) 
It follows that if P; and P; are particles of B 
In, Flair rl sci. 


where the c;; are constants, so that the rigidity conditions are also satisfied in F ’. But in F’ the particle B 
is fixed (at the origin) and so Theorem 16.1 applies. Hence there exists a unique vector w(t) such that, for 
any particle of 6, 


v =oxr'’. (16.14) 
On using (16.13) into (16.14) we obtain 
v—vF = ox(r—b), 
that is 
v=v? +@x(r—b), (16.15) 


as required. Hi 


What the last theorem shows is that any rigid body motion can be resolved into a 
translation with velocity v? and a rotation with some angular velocity @ about an axis 
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FIGURE 16.2 A rigid body in general motion. 

The velocities of its particles are the sum of 

those due to a translation with velocity v? 

and a rotation with angular velocity @ about 

an axis through B. 1 


through B, where the particle B can be any particle of the body. This is shown in Figure 
16.2. 

Now suppose we choose a new reference particle C. Then the translational velocity 
would become v©, but what happens to the angular velocity @? The answer is nothing; the 
angular velocity @ is independent of the choice of reference particle. This means that we 
can refer to the angular velocity of a rigid body without specifying the reference particle. 


The proof of this is as follows: 
Proof. Suppose that, with reference particles B, C, the body has angular velocities oF w respectively. 


Then the velocity v of any particle P of the body is given by either of the two formulae 


v? + x(r—b), 
v= vo +@° x(r—o), 


v 


where r is the position vector of P. It follows that 

ve +08 x(r—b) =v +. 0% x(r—c) (16.16) 
for any r that is the position vector of a particle of the body. In particular, since B and C are particles of the 
body, it follows that 


ve = 4.0% x(b— 0), 


v? +8 x(e—b) = vf, 
and if we now subtract each of these formulae from the equality (16.16), we obtain 
xx(r—b)=0, 
xx(r—c)=0, 


B 


where x = w® — w©. Now let P be any particle of the body not collinear with B and C and suppose that 


x # 0. Then x must be parallel to both of the vectors r — b and r — c, which are not parallel to each other. 


B 


This is impossible and so x = 0, which means that w” = w°. Hence the angular velocity of the body is 


the same, irrespective of the choice of reference particle. Hi 
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A Ww 


FIGURE 16.3 The ball rolls with velocity Vi and has angular velocity w. 


Our results are summarised below: 


Suppose a rigid body is in general motion and that B is one of its particles. Then the 
velocity of any particle P of the body is given by the formula 


where v? is the velocity of the reference particle B and the angular velocity @ is 
independent of the choice of reference particle. 


Particle velocities in a rigid body 


v=v? + wx(r—b), (16.17) 


Example 16.3 Rolling snooker ball 


A rigid ball of radius } rolls without slipping on a horizontal table. Find the most 
general form of @ consistent with the rolling condition. 


Solution 

Suppose the ball is rolling with velocity Vi (where i is a horizontal unit vector), and 
has an unknown angular velocity w. Then, on taking the centre C of the ball as the 
reference particle, the velocity of any particle P is given by (16.15) to be 


v=Vi+ox(r—c). 
In particular, the velocity of the contact particle Q is given by 
v2 =Vi+@x(-bk), 


where the unit vector k points vertically upwards. Since the rolling condition requires 
that v2 = 0, it follows that @ must satisfy the condition 


Vi+bkxw=0. (16.18) 
On taking the cross product of this equation with k, we obtain 


Vkxi+bkx(kxo) =0, 
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that is 
Vit+b(@-kk—(k- ko) = 0. 


Since k is a unit vector, k - k = 1 and we obtain 


V 


It follows that any » consistent with the rolling condition must have the form 


o= Ti tak (16.19) 
where A is a scalar function of the time. Conversely, it is easy to verify that the formula 
(16.19) for @ satisfies the rolling condition (16.18) for any choice of the scalar A. This 
is therefore the most general form of @ consistent with rolling. 

This result is surprising at first. If the motion were planar, the value of @ would 
be V /b, the corresponding w being (V/b)j. This is the special case 4 = 0. But in 
general three dimensional rolling, A 4 0 and the rotation axis is not horizontal. This 
effect is well known to pool and snooker players and is achieved by striking the ball 
to the right (or left) of centre, thereby giving 4 a positive (or negative) value. Players 
call this putting ‘side’ on the ball. It makes no difference to the rolling but affects the 
bounce when the ball strikes a cushion. 


Time to relax Find a pool table and experiment by striking a ball slowly but firmly well 
to the right of centre. The marking on the ball should enable you to ‘see’ the rotation axis 
(a ball with spots is best). Check that giving the ball ‘right hand side’ produces a positive 
value of @ + k. 


Example 16.4 Wheel rolling around a circular path 


A circular wheel of radius 5 has its plane vertical and rolls with constant speed V 
around a circular path of radius R marked on a horizontal floor. Find the angular 
velocity of the wheel and the acceleration of the contact particle. 


Solution 


Suppose we view the motion of the wheel from the rotating reference frame 
{O; i, j, k} shown in Figure 16.4. This frame rotates about the axis {O, k} with 
angular velocity & given by 


Q=6k= . k. (16.20) 


Viewed from the rotating frame, the wheel is rotating about a fixed axis parallel to 
the vector i. On applying the rolling condition, the angular velocity w’ of the wheel, 
viewed from the rotating frame, is given by 


i. (16.21) 
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Vj 


FIGURE 16.4 A wheel of radius b rolls around a circle of radius 
R. The unit vectors i(t) and j(t) follow the wheel as it moves 
around the circle; k is a constant unit vector. 


The true angular velocity w of the wheel is then given by the sum 
oa=o0 +2. (16.22) 


Here we are using a result from Chapter 17, the angular velocity addition theorem; 
it is the rotational counterpart of the addition theorem for linear velocities that we 
obtained in Chapter 2 . Although we have yet to prove this result, it is clear enough 
what it means and we will use it anyway! On substituting 16.20) and (16.21) into 
(16.22) we find the angular velocity of the wheel to be 
V V 
= ——i+ —k. 16.23 
@ a! + ; ( ) 
Now for the particle accelerations. These can be found by differentiating the 
velocity formula 


v=Vjtox(r—o), (16.24) 


with respect to t, where we have taken C, the centre of the wheel, as the reference 
particle. This gives 


di 
a=V— +ax(r—e) + ox 6) 
dj. , 

i ae ee =e Nd); 


since r = v and ¢ = Vj. In particular, since r2 = c — bk and v2 = 0, the 
acceleration of the contact particle Q is given by 
0 dj. ; 
ax =V a + @x(—bk) +@x(-VJ) 


dj di y? v2 
a yp ep pe (16.25) 


on using the formula (16.23) to replace and o. 
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The only unknown quantities left are di/dt and dj /dt. However, the vectors i, 
j correspond precisely to the polar unit vectors 7, 6 treated in section 2.3, from which 
we deduce that 


dj _ 


di .. Vv 
=6j=—j and = 
or ae. 


a 
On substituting these formulae into equation (16.25) we obtain 


v2 v2 
a ees 
a Ro S 


as the acceleration of the contact particle Q. & 


Problems on Chapter 16 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


16.1 A rigid body is rotating in the right-handed sense about the axis Oz with a constant 
angular speed of 2 radians per second. Write down the angular velocity vector of the body, 
and find the instantaneous velocity, speed and acceleration of the particle of the body at the 
point (4, —3, 7), where distances are measured in metres. 


16.2 A rigid body is rotating with constant angular speed 3 radians per second about a fixed 
axis through the points A(4, 1, 1), B(2, —1, 0), distances being measured in centimetres. The 


rotation is in the left-handed sense relative to the direction AB. Find the instantaneous velocity 
and acceleration of the particle P of the body at the point (4, 4, 4). 


16.3 A spinning top (a rigid body of revolution) is in general motion with its vertex (a particle 
on the axis of symmetry) fixed at the origin O. Let a(t) be the unit vector pointing along the 
axis of symmetry and let w(t) be the angular velocity of the top. (In general, @ does not point 
along the axis of symmetry.) By considering the velocities of particles of the top that lie on 
the axis of symmetry, show that a satisfies the equation 


a=oxa. 
Deduce that the most general form can have is 
@=axat+da, 


where A is a scalar function of the time. [This formula is needed in the theory of the spinning 
top.] 


16.4 A penny of radius a rolls without slipping on a rough horizontal table. The penny rolls 
in such a way that its centre G remains fixed (see Figure 16.5). The plane of the penny makes a 
constant angle a with the table and the point of contact C traces out a circle with centre O and 
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FIGURE 16.5 A penny of radius 5 rolls ona 
rough horizontal table in such a way that its 
centre G remains fixed. 


radius a cos a@, as shown. At time f, the angle between the radius OC and some fixed radius is 
@. Find the angular velocity vector of the penny in terms of the unit vectors a(t), k shown. 
Find the velocity of the highest particle of the penny. 


16.5 A rigid circular cone with altitude / and semi-angle a rolls without slipping on a rough 
horizontal table. Explain why the vertex O of the cone never moves. Let @(t) be the angle 
between OC, the line of the cone that is in contact with the table, and some fixed horizontal 
reference line O A. Show that the angular velocity w of the cone is given by 


o=— (6 cot a) i, 


where i(f) is the unit vector pointing in the direction OC. [First identify the direction of @, 
and then consider the velocities of those particles of the cone that lie on the axis of symmetry. ] 
Identify the particle of the cone that has the maximum speed and find this speed. 


16.6% Two rigid plastic panels lie in the planes z = —b and z = 5 respectively. A rigid ball 
of radius b can move in the space between the panels and is gripped by them so that it does not 
slip. The panels are made to rotate with angular velocities wk, w2k about fixed vertical axes 
that are a distance 2c apart. Show that, with a suitable choice of origin, the position vector R 
of the centre of the ball satisfies the equation 


R=QxR, 


where 2 = 5(@ 1 + @2). Deduce that the ball must move in a circle and find the position of 
the centre of this circle. [This arrangement is sometimes seen as a shop window display. The 
panels are transparent and the ball seems to be executing a circle in mid-air. ] 


16.7 Two hollow spheres have radii a and b (b > a), and their common centre O is fixed. A 
rigid ball of radius 5(b — a) can move in the annular space between the spheres and is gripped 
by them so that it does not slip. The spheres are made to rotate with constant angular velocities 
@ 1, @2 respectively. Show that the ball must move in a circle whose plane is perpendicular to 
the vector a@, + bo. 


Chapter Seventeen 
| 


Rotating reference frames 


KEY FEATURES 
The key features of this chapter are the transformation of velocity and acceleration between 
frames in general relative motion, and the dynamical effects of the Earth’s rotation. 


So far we have viewed the motion of mechanical systems from an inertial reference 
frame. The reason for this is simple; the Second Law, in its standard form, applies only in 
inertial frames. However, circumstances arise in which it is convenient to view the motion 
from a non-inertial frame. The most important instance of this occurs when the motion 
takes place near the surface of the Earth. Previously we have argued that the dynamical 
effects of the Earth’s rotation are small enough to be neglected. While this is usually true, 
there are circumstances in which it has a significant effect. In long range artillery, the 
Earth’s rotation gives rise to an important correction, and, in the hydrodynamics of the 
atmosphere and oceans, the Earth’s rotation can have a dominant effect. If we wish to cal- 
culate such effects (as seen by an observer on the Earth), we must take our reference frame 
fixed to the Earth, thus making it a non-inertial frame. The downside of this choice is that 
the Second Law does not hold and must be replaced by a considerably more complicated 
equation. 

In addition to applications involving the Earth’s rotation, there are instances where 
the motion of a system looks much simpler when viewed from a suitably chosen rotating 
frame. The Larmor precession of a charged particle moving in a uniform magnetic field 
is one example; a second is the motion of a rigid body relative to its own principal axes of 
inertia, which leads to Euler’s equations. 


17.11 TRANSFORMATION FORMULAE 


In this section we derive the transformation formulae that link the velocity and 
acceleration of a particle measured in a moving frame, with the same quantities measured 
in a fixed frame. For the purposes of kinematics, the labels ‘fixed’ and ‘moving’ are 
arbitrary. Each frame is moving relative to the other and the labels could be reversed; we 
use them purely for convenience. In dynamics however the distinction between the fixed 
and moving frames is real. The ‘fixed’ frame is an inertial frame, in which Newton’s laws 
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a 


B! 


FIGURE 17.1 The moving frame F’ = {O'; e/, e/, e4} has translational velocity 
V and angular velocity & relative to the fixed frame F = {O; e1, e2, e3}. 


apply, and the moving frame is a non-inertial frame, in which Newton’s laws do not apply, 
at least in their standard form. 

Let F = {O; e}, €2, €3} be the fixed frame and F’ = {O’; e', e4, e4} be the moving 
frame, as shown in Figure 17.1. At time r, the frame F’ has translational velocity* V and 
angular velocity @ relative to the frame ¥. It is convenient to regard the moving reference 
frame F’ as being embedded in a rigid body 6’ with reference particle O’. Then V and 
Q are the translational and angular velocities of the body 6’. The most important feature 
of reference frames in relative motion is this: 


The rate of change of a vector quantity measured in the frame F is generally not the 
same as the rate of change of the same quantity measured in the frame F’. 


Question Why are there different rates of change? 


Suppose u(t) is a vector quantity. Why should it have different rates of change when 
measured in the frames F and F’? 


Answer 


Suppose the expression for u in terms of the basis set {e;, e2, e3} is 

u=u,e; +u2e2 +:13e3, (17.1) 
and in terms of the basis set {e', e4, €3} is 

u=ue, +u,e, +u5e. (17.2) 


In general, the components {u1, u2, v3} and {u',, u, u4} will be functions of the time 
ie 


* This means that the origin O’ has velocity V relative to F. The angular velocity @ is independent of the 
choice of O’. 
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In the rate of change of u measured in F, the basis set {e1, e2, e3} is, by defini- 
tion, constant so that 


du : ‘ i 
@ =e; +u2e0 +1 @3. (17.3) 
dt) ¢ 


In contrast, in the rate of change of u measured in F’, the basis set {e', e4,, e4} is, by 
definition, constant so that 


d 
(=) aii aietae (17.4) 
dt} ¢ 


Note that the components {u,} and {uj} are scalar functions of the time and so their 
rates of change are independent of the reference frame. This is why we do not need to 
label them as being observed in F or F’. There is no reason why the two expressions 
(17.3) and (17.4) should be equal and, in general, they are not equal. Consider, for 
example, the case in which u is constant in F' so that (du/dt)¢ = 0. However, the 
motion of F’ relative to F means that u will not be constant in F and (du/dt)¢ will 
not be zero. 


True and apparent values 


In order to simplify the writing, we will, from now on, refer to the value of a quantity 
measured in the fixed frame F¥ as its true value, and the value measured in the moving 
frame F’ as its apparent value. For example, (du/dt) + will be referred to the true value 
of du/dt, while (du/dt) + will be referred to as its apparent value. 


Rates of change of the basis vectors {e’', e’,, e'} 


Our first step is to find the rates of change of the fundamental basis vectors {e', e5, e3} 
belonging to the frame F’. Since these vectors are, by definition, constant in F’, their 
apparent rates of change are zero. What we need to find are their true rates of change. 

Let E be any particle of the body B’ in which the frame F’ is embedded, and let e 
and e’ be the position vectors of E relative to O and O’ respectively. Then, by the triangle 
law, 


e=D+eé, 


where D is the position vector of O’ relative to O. Then 


de dD de’ 
(x), - (Sr),+ (a), ple 
de’ 
=V+ (=) ; (17.6) 
F 


where V is the true velocity of O’ relative to O. But (de/dt)F is, by definition, v=, the 
true velocity of the particle E, and, since E is a particle of the rigid body B’, v” is given 
by 


vi =V+Q2xe', (17.7) 
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on using the kinematical formula (16.17). On comparing the equations (17.6) and (17.7), 


‘A 
(=) =Qxe’. 
dt ) xr 


This result applies to any vector e’ that is the position vector (relative to O’) of a particle 


we see that 


of the rigid body B’. In particular, since the basis vectors {e',, e5, e} can be regarded as 
the position vectors of particles of B’, we obtain the fundamental relations 


de’. 
—}] =axe’ (l= 7 = 3). 
F 


The notation (d e’. /dt) for the true rate of change of the vector e’, is accurate but cum- 
bersome. Since the apparent rate of change of these vectors is zero, there is little chance 
of confusion if, from now on, we replace (de’, i dt)# by the simple notation é . Our result 
can then be expressed in the form: 


True rates of change of the basis vectors {e’/,, e’, e3} 
(17.8) 


@=Oxe, @)=Oxc 8 =2xe;, 


where @ is the angular velocity of the frame F’ relative to the frame F. 


Relation between the true and apparent values of du/dt 


Our next step is to find the relationship between the true and apparent values of du/dt, 
where uw is any vector function of the time. To do this we differentiate the representation 
(17.2) with respect to t while keeping the basis set {e1, e2, e3} constant. This gives 


du d(u\e}) d(u5e) ae) 
GE -( dt ere dt el dt Jr 


= (ae, +e, + 5e5) tule +u5e +u5e 


tu) (Qxe}) +5 (@x es) +.u5 (Qxe5) 


+ Qx ( (u' cA + use +u5e) 


(2), 0 
) 


F’ 


t 


Q 
(is) + Qxu. 


The true and apparent values of du/dt are therefore related by the formula: 
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Transformation formula for du/dt 
(=) (=) (17.9) 
—)}] =(— + Qxu 
di je dt) es 


where @ is the angular velocity of the frame F’ relative to the frame F. 


Velocity transformation formula 


It is now easy to find how particle velocities transform between the two frames. Suppose 
a particle P has position vector r relative to F and position vector r’ relative to F’. It 
follows from the triangle law that 


P= De", (17.10) 


where D is the position vector of O’ relative to O. Then 


(z _ =) + dr’ 
dt}, \dt}e \at} 


v+(2) 
v= —), 


where v is the true velocity of P. From the transformation formula (17.9), it follows that 


/ / 
(),-(ip),0 
at ge dt J zi 


=v+Qxr’, (17.11) 


that is, 


where v’ is the apparent velocity of P. On combining these results, we obtain: 


Velocity transformation formula 
(17.12) 
v=V+Qxr+d 


This is the velocity transformation formula connecting the true and apparent velocities 
of the particle P. 

This formula has a simple interpretation. The true velocity v is the sum of (i) the 
apparent velocity v’, and (ii) the velocity V + & xr’ that is given to the particle by the 
motion of the frame F’. 

An immediate consequence of the velocity transformation formula is the correspond- 
ing transformation rule for angular velocities. 
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Fixed cone Rolling cone 


FIGURE 17.2 The dark shaded cone is fixed and the light 
shaded cone rolls around it. The vertex of each cone is at 
rest at O. 


Theorem 17.1 Addition theorem for angular velocities Suppose a rigid body in 
general motion has true angular velocity @ and apparent angular velocity @'. Then @ 
and ! are related by the addition formula 


o=2+a, (17.13) 


where & is the angular velocity of the frame F' relative to F. [This result is the counter- 
part of the velocity transformation formula v = V + v’ for translating frames.] 


The proof of this theorem is the subject of Problem 17.2. We have already used this 
result in Chapter 16, Example 16.4, and we give a second example now. 


Example 17.1 Cone rolling on another cone 


A circular cone with semi-angle 45° is fixed with its axis of symmetry vertical and 
its vertex O upwards. An identical cone also has its vertex fixed at O and rolls on 
the first cone so that its axis of symmetry precesses around the upward vertical with 
angular speed A as shown. Find the angular speed of the rolling cone. 


Solution 

Let F be a fixed frame and F’ be a frame with origin at O and precessing with the 
rolling cone. Then F’ has angular velocity 1k relative to F, where the unit vector k 
points vertically upwards. In the frame F’ the second cone has its axis of symmetry 
fixed and the first cone has angular velocity —A k. The rolling contact between the 
two cones means that the second cone must be rotating about its axis of symmetry 
{O, n} with angular speed A in the negative sense. Its angular velocity is therefore 
—An. In the notation of Theorem 17.1 we therefore have Q = Ak and w = —An. It 
follows that the angular velocity of the rolling cone is 


@=Q2+4+o' =Ak—An. 


The angular speed of the rolling cone is therefore 2). ml 
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Acceleration transformation formula 


From the velocity transformation formula (17.12), it follows that 


(=) (=) (F) ; (=) (=) 
—) =(—] +{—] xr4+Qx(—]} +{—] . 
dt) ¢ dt} ¢ dt Jr dt Jz dt Jr 


ey ar’ dv’ 
a=A+Qxr4+Q2x{—) +(|—] , (17.14) 
F \dt/¢ 


that is, 


dt 


where a is the true acceleration of the particle P, and A, & and Q are the translational 
acceleration, the angular velocity, and the angular acceleration of the frame F’, all relative 
to the frame F. 

The derivative (d r'/d t) £ has already been evaluated in (17.11), and the derivative 
(d v'/ dt) 1s given by the transformation formula (17.9) to be 


dv’ dv’ 
ara 
dt Jr dt Jr 
=a'+ Qxv'’, (17.15) 
where a’ is the apparent acceleration of P. On combining these results, we obtain 
a=A+2xr'+2x(v'+2xr') + (a+ 2xv’) 


which, on simplification, gives the acceleration transformation formula: 


Acceleration transformation formula 
(17.16) 
a=A+Q2xr'+2Qxv + &x(Qxr) +a’ 


where A is the translational acceleration, @ is the angular velocity and Q is the angular 
acceleration of the frame F’, all relative to the frame F. 

The acceleration formula does not have a simple interpretation. The apparent accel- 
eration is a’ and the acceleration given to the particle by the motion of the frame F’ is 
A+Q2xr'+Qx (Q xr’). However, the true acceleration is not the sum of these two 
contributions! There is the additional term 2 & x v’ that appears only when the particle is 
moving within the moving frame. This Coriolis* term, as it is called, is hard to explain 
physically. 


* After Gaspard-Gustave de Coriolis (1792-1843) whose paper on the subject was published in 1835. He 
also introduced the notion of ‘work’ with its present scientific meaning. 
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Centrifugal force 
—m2 x(Qxr’) 


FIGURE 17.3 Man walking on a rotating roundabout. 


17.2 PARTICLE DYNAMICS IN A NON-INERTIAL FRAME 


Suppose now that F is an inertial frame. Then the equation of motion of a 
particle P of mass m moving under the force F is simply given by the Second Law 


ma=F, (17.17) 
where a is the true acceleration of P. It follows from the acceleration transformation 


formula (17.16), that, when the same motion is observed from the frame F’, equation 
(17.17) is replaced by the transformed equation of motion 


Second Law in a general non-inertial frame 
(17.18) 


m|A+2xr! +2Qxv' + 2x (@xr') +a] = F. 


Here r’, v’, a’ are the apparent position, velocity and acceleration of P, and A, & and Q 
are the translational acceleration, angular velocity and angular acceleration of the frame 
F' relative to the frame F. 

This is the form taken by the Second Law in a general non-inertial frame. Note that 
there is no new physics in this equation. It is simply the Second Law written in terms of 
the apparent quantities observed in the non-inertial frame F’. 


Example 17.2 Man walking on a rotating roundabout 


A fairground roundabout is made to rotate with constant angular speed Q about its 
axis of symmetry which is fixed in a vertical position, the sense of the rotation being 
as shown in Figure 17.3. A man is on the roundabout and is walking grimly towards 
the centre O’ along the moving radius A’O’ with constant speed V. What is the force 
that the roundabout exerts on the man? 
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Solution 
Let the frame F = {O; e1, e2, e3} be an inertial frame of reference attached to the 
ground and the frame F’ = {O’; e', e4, e4} be attached to the roundabout, as shown 
in Figure 17.3. Then F’ has zero translational velocity and constant angular velocity 
Q (= Qe3) relative to F. Hence A = 0 and Q = 0 in this problem. 

In the frame F’, the motion of the man is uniform and rectilinear so that r’ = 
x, e, v’ = —Vel anda’ = 0. The transformed equation of motion (17.18) therefore 


reduces to 
m [2 (8e3) x (—V e}) + (Qe3) x ((Qe3) x (x1 e)) |] = —mge3 + X, 
where X is the force that the roundabout exerts on the man. Hence, X is given by 
X = (mg)e3 — (2mQV) e4 _ (mQ?x\)e}. 


Because of the additional terms* in this expression, the man must lean forwards and 
to his left; otherwise he will fall over. 


Fictitious forces 


Equation (17.18) can be made to resemble the standard form of the Second Law by the 
simple device of transferring the four new terms on the left to the right side of the equation 
and regarding them as fictitious forces that act on P in addition to the real force F. This 
gives 


mal = F + (—mA) + (—mQxr’) + (-2mQ2xv’) + (-mQx(@xr’)), 
—"—-__ —-e-—@r oe 
Coriolis force centrifugal force 


(17.19) 


which we will call the fictitious force equation. 

In other words, the fact that the frame F’ is non-inertial may be ignored provided that 
the four fictitious forces —-mA, —mQ2 xr’, -2mQ~x v' and —mQ x (@ xr’) are added to 
the real force F. Two of these ‘forces’ have names. The ‘force’ —2m Q x v’ is called the 

Coriolis force and the ‘force’ —m 2 x (Qxr’) is called the centrifugal force.’ In the 
problem of the man on the rotating roundabout, only the Coriolis and centrifugal ‘forces’ 
are non-zero and their directions are shown in Figure 17.3. 

The transformed equation of motion and the fictitious force equation are obviously 
equivalent and there is no logical need to introduce the notion of fictitious forces at all. 
However, the fictitious force approach is often useful when looking at a problem qualita- 
tively. For example, in the problem of the man walking on the roundabout, the fictitious 
force description would be as follows: 


* Tf the roundabout were at rest, then X would be simply mg e3. 
+ Although the name ‘centrifugal force’ for the expression —m Q x (Q x r’) is standard in the literature, it 
is not really appropriate unless O’ = O, that is, the two frames of reference have a common origin. 
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FIGURE 17.4 The inertial frame F = {O; e1, e2, e3} is attached to the 
surface of the Earth. 


‘Forget that the roundabout is rotating, and introduce the Coriolis and centrifugal 
forces as shown in Figure 17.3. The man will then feel the Coriolis force pushing him 
to his right and the centrifiugal force pushing him backwards. This is why he must lean 
forwards and to his left’ . 


Such arguments are fine at the qualitative level, where they provide a picture of what 
is going on. However, it is asking for trouble to have two parallel approaches available 
when solving problems. A choice must be made and, in this book, we will always solve 
problems in rotating frame dynamics by using the transformed equation of motion (17.18); 
ficticious forces will only be mentioned in passing. 


17.3. MOTION RELATIVE TO THE EARTH 


The most common non-inertial reference frame is the Earth. The effects of the 

Earth’s motion are generally small and we have so far completely neglected them when 

solving problems in dynamics. However, there are circumstances in which the Earth’s 

motion has a significant effect. In long range artillery, the Earth’s rotation gives rise to 

an important correction,* and, in the hydrodynamics of the atmosphere and oceans, the 
Earth’s rotation can have a dominant effect. 

The Earth moves in its orbit around the Sun and also rotates on its axis. The effect of 

its orbital motion is very much smaller than the effect of its rotation and we will neglect 


* Artillery shells are deflected to the right in the northern hemisphere and to the /eft in the southern. The 
following anecdote is due to the great English mathematician J.E. Littlewood. ‘I! heard an account of the 
battle of the Falkland Islands (early in the 1914 war) from an officer who was there. The German ships 
were destroyed at extreme range, but it took a long time and salvos were continually falling 100 yards to 
the left. The effect of the rotation of the Earth was incorporated into the gun-sights. But this involved the 
tacit assumption that Naval battles take place round about latitude 50°N, whereas the Falkland Islands 
are at about latitude 50°S. At extreme range, this double difference is of the order of 100 yards! 
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the orbital motion of the Earth altogether. We will regard the Earth as an axially symmetric 
rigid body* whose axis of symmetry is at rest in an inertial frame Z. Previously we have 
denoted the moving frame by F’, but it would be insufferable to attach a dash to every 
quantity from now until the end of the chapter. We will therefore drop the dashes and, 
from now on, we will let F = {O; i, j, k} be a non-inertial frame attached to the surface 
of the Earth, as shown in Figure 17.4. The orientations of the basis vectors {i, 7, k} will 
be described later. Because of the Earth’s rotation, the frame F has an acceleration A and 
an angular velocity @ relative to the inertial frame Z. The acceleration A is caused by the 
circular motion of the origin O around the Earth’s rotation axis and acts towards the axis. 
The angular velocity Q is the same as the angular velocity of the Earth itself; it is therefore 


parallel to the rotation axis and constant. If n is the unit vector in the direction SN , where 
S and N are the north and south poles, then Q = + Qn, where Q is 27 radians per day,’ 
or about 0.000073 radians per second. 

Let P be a particle of mass m and position vector r relative to O. Then the trans- 
formed equation of motion for P is 


m[A+22xv+2x(Qxr)+al =F. (17.20) 


Here r, v and a are the apparent position, velocity and acceleration of P, and F® is the 
gravitational force exerted on P by the Earth. There may be other forces acting on P, but, 
for the moment, we suppose that the only force is the Earth’s gravity. 

Suppose first that P is released from rest (in F) at the point with position vector r. 
Then its initial acceleration ag is given by equation (17.20) to be 


aj =m !F’ —~A-&x(Qxr). (17.21) 


If the Earth were not rotating, this acceleration would be the acceleration due to the Earth’s 
gravity at the point r. With rotation present, it is still the observed acceleration of a 
body released from rest, but it is not caused by gravity alone. We call this the apparent 
gravitational acceleration (at the point 7). The direction opposite to ao is the apparent 
vertical direction. Let k (= k(r)) be the unit vector in the apparent vertical direction. 
Then a! can be written in the form 


ayo = —g(r)k(r), (17.22) 


where g is the magnitude of the apparent gravitational acceleration. These values of g 
and k differ slightly (by less than 1%) from the corresponding quantities due to Earth’s 
gravity alone. In terms of the quantities g and k, the transformed equation of motion for 
P simplifies to give 

dv 


m E +29x0| = —meg(r)k(r). 


* We are not restricted to regard the Earth as spherically symmetric. 
* Actually, the Earth revolves not once every day, but every 23 h 56 m. Why? 
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As in the non-rotational theory, we will usually suppose that the extent of the motion of P 
is small compared to the Earth’s radius so that the spatial variations of g(r) and k(r) can 
be neglected. In this approximation, the transformed equation of motion for P becomes 


Projectile equation on a rotating Earth 


ai (17.23) 
m at + 2Q2xv| =-—mgk 


where g and k are now constants. This is the projectile equation for a rotating Earth. If 
any other forces act on P, they should be included on the right side of this equation. Note 
that, by introducing apparent gravity, this equation differs from the standard projectile 
equation only by the appearance of the Coliolis term. 

We will now assign directions to the basis vectors {i, 7, k} of our frame F. The basis 
vector k is taken to be the same as the apparent upward vertical vector k (now assumed 
constant); the vector i is taken so that the Earth’s rotation axis lies in the plane Oxz (see 
Figure 17.4); the vector j is then determined. In geographical language, i points south 
and j points east. With these choices, the Earth’s angular velocity is then 


Q = Q(—sin Bi + cos Bh), (17.24) 


where # is the angle between the apparent vertical at O and the Earth’s rotation axis. 6 
is approximately 90° — 6, where @ is the geographical latitude of O. [Why only approxi- 
mately?] Despite this small discrepancy, we will call the angle 6 the co-latitude of O. 


Example 17.3 Tower of Pisa experiment 


A lead ball is dropped from the top of the tower of Pisa. Show that its path deviates 
from that of a plumbline and find where it lands. [Neglect air resistance but include 
the effect of the Earth’s rotation.] 


Solution 

Consider first the plumbline. The bob is subject to the apparent gravity force —mgk 
and the string tension force 7. When the plumbline is in equilibrium it follows from 
equation (17.23) that 0 = —mgk + T, that is, T = mgk. Hence the tension in the 
plumbline is mg and the string is parallel to the apparent vertical. 

Now consider the ball. Take the origin O of coordinates at the point where the 
ball is released and suppose that the apparent vertical through O meets the ground at 
the point (0,0, —h). The equation of motion for the ball is the projectile equation 
(17.23), together with the initial conditions v = 0 and r = 0 when t = 0. Since @ is 
constant, equation (17.23) can easily be integrated once with respect to ¢ to give 


dr 
pt ee See 
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where C is the constant of integration. The initial conditions then imply that C = 0 
so that the displacement of the ball satisfies the first order equation 

dr 

—+2Qxr=-—gtk (17.25) 

dt 
with the initial condition r = 0 when t = O. Since we expect the effect of the 
Earth’s rotation to be a small correction, we will solve this equation approximately 
by an iterative method. On integrating equation (17.25) with respect to ¢ and using 
the initial condition, we find that the unknown displacement r(¢) satisfies the integral 
equation* 


t 

7O)= —}er7k- 29x f r(t’)dt’. (17.26) 
0 

(The variable of integration has been changed to the dummy variable ¢’ because t¢ 

appears as a limit of integration.) We now solve this equation approximately by iter- 

ation. The zeroth order approximation r) corresponds to the case Q = 0, that is, the 

case of the non-rotating Earth. Thus the zeroth order approximation is 


r = —J 90k, (17.27) 


which is just the elementary solution for vertical motion under uniform gravity. The 
first order approximation r“ is now obtained by substituting the zeroth order approx- 
imation into the integral term in equation (17.26). This gives 


¢ 
r) = her k— 20% | rw) dt! 
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= —Agt?k + tet (Qxk). 


Now the angular velocity & is given by equation (17.24), where f is the co-latitude of 
the point O. It follows that Qxk = (Q sin B) j and so the first order approximation 
for the displacement r(f) is given by 


r) = —S et? k + det*(Qsin B) j. (17.28) 


Higher approximations can be obtained in the same way, but they are progressively 
less important’ (and more complicated). The first order appoximation predicts that 


* Any equation in which an unknown function appears under an integral sign is called an integral equation. 
There is an extensive theory of such equations. Integral equations like (17.26) have the impressive title 
of Volterra integral equations of the second kind. Their most important feature is that they can always be 
solved by the iteration process described. 

* The iterative approximation scheme described above will converge rapidly when the dimensionless prod- 
uct Qt is small, where Tt is the total travel time. (Qt is the angle through which the Earth rotates during 
the motion.) In the Tower of Pisa problem, the value of Qr is about 0.0002, which explains why the 
correction is so small. 
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FIGURE 17.5 Left: The Foucault pendulum is an ordinary pendulum suspended 
from a point O fixed to the Earth. Right: The precessing frame OXY Z rotates 
around the axis Oz with angular velocity —(Q cos B)k. 


the ball will not travel down the apparent vertical, but will be displaced in the positive 
j-direction, that is, to the east. It remains to determine this easterly displacement in 
terms of h, the height of the tower. For this we need the total travel time t, but it 
is consistent with the first order approximation to determine t from the zero order 
theory which gives t = (2h/g)'/*. Hence, because of the Earth’s rotation, the ball 
does not travel down the apparent vertical exactly but drifts to the east a distance 


gn3\/ 
(=) Q sin B. 
& 


For a ball dropped from the top of the Tower of Pisa, which is about 50 m high and 
is at latitude 44°N, this easterly drift is about 5.6 mm. 


Example 17.4 The Foucault pendulum 


How does the Earth’s rotation affect the small oscillations of an ordinary pendulum? 


Solution 
Let the origin O be the suspension point of the pendulum as shown in Figure 17.5. 
The transformed equation of motion for the pendulum bob P is 


dv 
m| +200] =-—mgk+T, 


where T is the tension force exerted by the string. The magnitude T of the tension 
force is unknown but its direction is opposite to the position vector r so that 
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where a is the length of the string. The equation of motion for P can therefore be 
written 


dv T 
m| 2+ 29x0] = —mgk — (=) Tr. (17.29) 
dt a 


This non-linear equation, which holds for large oscillations of the pendulum, cannot 
be solved in closed form. We will therefore restrict attention to the case of small 
oscillations which allows us to replace (17.29) by a linear approximation. The zero 
order approximation corresponds to the ‘motion’ r = —ak, in which the pendulum 
hangs in equilibrium. Thus the zero order approximation to the tension is T = mg. 

Furthermore, if we write r = xi+ yj +k, then, in any motion of the pendulum, 
x? + y? +2? = a’, from which it follows that 


Pee me ee ue 
( ( y?)) 


2 2 
x+y 
= 1 wee dy 


which shows that the change in z is second order in the small quantities x and y. It can 
similarly be shown that v, and dv, /dt are also second order quantities. On using these 
approximations in equation (17.29), linearised equation for the horizontal motion 
of P is found to be 


dul H 2H 
re y+tnr* =0, (17.30) 
where r? = xi+yj,v" =vyit vy J and n> = g/a. Note that n is the angular fre- 


quency the pendulum pendulum would have if the Earth were not rotating. Equation 
(17.30) is the linearised equation for the motion of the point P’ shown in Figure 17.5. 

The two-dimensional vector equation (17.30) is equivalent to a pair of homoge- 
neous second order linear equations with constant coefficients and can therefore be 
solved by standard methods. However, this is a messy job and there is a much sim- 
pler way of generating the solutions. The term 2Q cos (kx v") can be eliminated by 
viewing the motion from a frame* OXY Z rotating with angular velocity —(Q cos B)k 
relative to the frame F (see Figure 17.5). We will call this new frame the precessing 
frame and the horizontal displacement, velocity and acceleration of P in this frame 
will be denoted by R”, V and A”. The standard transformation formulae then give 


v? =V"4 (-QcosBk)xR, a” = A” +(-QceosBk)xV (17.31) 


and if these formulae are substituted into equation (17.30), we find that the linearised 
equation of motion for P in the precessing frame can be written in the form 
d*R" 
dt? 


4 (0? _~ 9? cos” B) R! =0, (17.32) 


* Note that this operation does not simply return us to the underlying inertial frame. 
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which is the two-dimensional SHM equation. Its solutions are known to be ellipses 
with centre O and angular frequency (n? — Q? cos” £)!/?. The difference between 
this frequency and n is completely negligible. 


Conclusion Jn small oscillations, the period of the pendulum is unchanged by the 
Earth’s rotation, but its motion now precesses* around the apparent vertical with 
angular velocity —(Qcos B)k. In particular, an oscillation that would take place 
in a fixed vertical plane through O when Q = 0 will now take place in a verti- 
cal plane through O that precesses slowly around the axis Oz with angular velocity 
—(Qcos B)k. This precession is in the negative sense in the northern hemisphere, 
and in the positive sense in the southern hemisphere. @ 


This practical demonstration of the Earth’s rotation is called Foucault’s pendulum‘ 
and is a favourite exhibit in science museums the world over. [But not in Singapore. 
Why?] From the duration of your museum visit, and the angle turned by the plane of 
motion of the pendulum while you were inside, you can work out your latitude! 


Winds, ocean currents and bathwater 


Atmospheric winds and ocean currents are often strongly influenced by the Coriolis force 
induced by the Earth’s rotation. We may analyse these fluid motions by analogy with 
particle dynamics. 


By analogy with equation (17.23) for the motion of a particle over the Earth, the equation of motion 
for the steady flow of the atmosphere or the ocean is 


pat+2pQxv =—pgk — grad p. (17.33) 


Here, v(r) and a(r) are the velocity and acceleration of the fluid particle with position vector r, p is the 
fluid density and p is the pressure field in the fluid. The term — grad p is called the pressure gradient and 
represents the force exerted on a fluid particle by the fluid around it. The acceleration a can be written in 
terms of v. If we follow the progress in time of a fluid particle, it follows from the chain rule that 
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Let us now compare the sizes of the two acceleration terms on the left of equation (17.33). Suppose that 
the flow has characteristic length scale L and velocity scale V. Then the Coriolis acceleration 2 & x v is of 
order QV and, from equation (17.34), the acceleration a is of order v2 /L. The dimensionless ratio of these 
two magnitudes, given by 


Ro=—, (17.35) 
QL 


is called the Rossby number* of the flow. When the Rossby number is small, the Coriolis acceleration 
dominates over the real acceleration. When this condition is satisfied, the equation of motion (17.33) can 
be approximated by 


20Q2xv = —pgk — grad p. (17.36) 


* A precession is a slow rotation superimposed on a fast motion. 

+ After Jean Bernard Leon Foucault (1819-1 868), who first exhibited it at the 1851 Paris Exhibition. This 
was the first time the Earth’s rotation had been demonstrated by an entirely terrestrial method. 

* After the Swedish-American meteorologist Carl-Gustav Rossby (1898-1957). 
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which is called a geostrophic flow. If the flow velocity v is everywhere horizontal, then, on taking the 
scalar product of this equation with v, we obtain 


v-+ grad p = 0. (17.37) 


Hence, in a horizontal geostrophic flow, the fluid velocity is perpendicular to the horizontal pressure gradi- 
ent. In other words, the fluid flows along the isobars. 


Weather maps often show winds circulating around areas of low pressure. In a large low 
pressure system on the Earth, V ~ 10ms~!, L ~ 1000km and Q ~ 107‘ radians 
per second. The corresponding Rossby number is Ro ~ 0.1. Such a flow is therefore 
quite accurately geostrophic. We would therefore expect the wind to follow the isobars 
in the sense predicted by equation (17.36) (anti-clockwise around a ‘low’ in the northern 
hemisphere) and qualitatively this is what is observed. On the other hand, in the wind flow 
near the centre of a hurricane, V ~ 50 ms~! and L ~ 50 km so that the Rossby number 
Ro © 10. Hence, contrary to common belief, the Coriolis force has an insignificant* effect 
on the wind in this part of a hurricane. Even less significant is the effect of the Earth’s 
rotation on the swirling motion of water leaving a bath or a toilet bowl. The direction of 
the water in such flows is strongly influenced by plumbing but not by Coriolis force! 


17.4 MULTI-PARTICLE SYSTEM IN A NON-INERTIAL FRAME 


Some of the principles of multi-particle mechanics can be extended to non- 
inertial reference frames. 


Linear momentum principle 
The linear momentum principle (see Chapter 10) can easily be extended to the case in 
which the system is observed from a non-inertial frame. 


In the notation for systems of particles used in Part II, the transformed equation of motion for the 
particle P; of mass m; is 


N 
. 1 . 
m; [A+ @xr) 20x; + Q2x(Qxrj) + al =F; +> Gij 
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(1 <i < N). Here, all quantities are observed in the moving frame F ’ the dashes being understood 
throughout. On summing these equations over all the particles we obtain 
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* And yet the direction of such winds is always that predicted by the geostrophic theory! However, the 
flow was geostrophic in the early stages of the hurricane’s formation, and this presumably determines the 
direction of flow thereafter. 
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As in the standard linear momentum principle, the double sum of the {G;;} is zero. In terms of centre of 
mass variables, this equation can be written 


é dV 
M [A+ Ox R429xV 4 9x(@xR) + | =F, (17.38) 


where R and V are the position and velocity of the centre of mass, M is the total mass, and F is the total 


external force. This is the centre of mass form of the linear momentum principle in a non-inertial frame. 


We have therefore proved that: 


Linear momentum principle in a non-inertial frame 


In any reference frame, the centre of mass of a system moves as if it were a particle 
of mass the total mass, and all the external forces acted upon it. 


Hence, even in a non-inertial frame, the motion of the centre of mass of a system can 
still be calculated by particle mechanics. 


Energy principle 
There is also a form of energy conservation that applies in frames with a fixed origin and 
constant angular velocity. The proof is as follows: 


In this case the acceleration A and the angular acceleration Q are zero and the transformed equation 
of motion for the particle P; becomes 


N 
1 7 
mj; [2@x», + Q2x(Qxrj) + al =F; +)>° Gij, 
j=l 


in the standard notation. If we now take the scalar poduct of this equation with v;, the Coriolis term vanishes 
and we obtain 


N 

dv; 
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(1 <i < N). This is the same equation as for an inertial frame except for the centrifugal term. This term 

can be written 
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where p; is the perpendicular distance of P from the rotation axis. On substituting this formula into 
equation (17.39) and summing over all the particles we obtain 
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FIGURE 17.6 The straight rod rotates with Q 
constant angular speed Q around O and the 
square lamina slides along the rod. O 
that is 
d N N WN 
= (T= 51a) = OF; + > Givi, (17.40) 
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where T is the apparent total kinetic energy and J, is the moment of inertia of the system about the 
rotation axis A. The right side of equation (17.40) is the apparent total rate of working of all the forces. If 
the apparent total work done can be represented by the potential energy V, we will say that the system is 
apparently conservative. In this case, we have conservation of energy in the form 


where E is a constant. The term — Al AQ? is sometimes called the centrifugal potential energy and we 
will call this equation the transformed energy equation. Our result is summarised as follows: 


Energy conservation in a uniformly rotating frame 


If a system is apparently conservative when viewed from a rotating reference frame 
with a fixed origin and constant angular velocity, energy conservation holds in the 
modified form 


T+V —j14W=E, (17.41) 
where T and V are the apparent kinetic and potential energies, Q is the angular 


speed of the reference frame, and /.4 is the moment of inertia of the system about 
the rotation axis. 


For systems with one degree of freedom, this energy conservation principle is suffi- 
cient to determine the motion, as in the following example. 


Example 17.5 


A long straight smooth rod lies on a smooth horizontal table. One of its ends is fixed 
at point O on the table and the rod is made to rotate around O with constant angular 
speed Q2. A uniform square lamina of side 2a can slide freely on the table. One of 
its sides is in contact with the rod at all times and the lamina slides freely along the 
rod. Initially, the lamina is at rest (relative to the rod) with one corner at O. Find the 
subsequent displacement of the lamina as a function of the time. 
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Solution 
We will view the motion of the lamina from a reference frame with origin O in which 
the rod is at rest. This frame has a fixed origin and constant angular velocity. 

First we find the apparent kinetic energy. Let r be the distance OC shown in 
Figure 17.6. In the rotating frame, the motion of the lamina is rectilinear and its 
apparent kinetic energy is therefore given by T = 5M r, 

Next we find the apparent potential energy. There are no specified external forces, 
but there is the constraint force exerted on the lamina by the rod, which, since the rod 
is smooth, acts perpendicular to the rod. In an inertial frame, this constraint force 
does work, but, in the frame rotating with the rod, the lamina simply moves along 
the stationary rod. The apparent rate of working of the constraint force is there- 
fore zero. The internal forces within the lamina enforce rigidity and do no appar- 
ent work in total. The system is therefore apparently conservative with potential 
energy V = 0. 

The transformed energy equation is therefore 


2 2 
5M? — S1,4Q’? =E, 


where J, is the moment of inertia of the lamina about the vertical axis through O, and 
Q is the angular speed of the rod. By the ‘parallel axes’ theorem (see the Appendix 
at the end of the book), /\4 is given by 


TA =Igt (a? +77), 
so that the equation becomes 
5Mr? — } (Ic +a? +r) Pas. 
The initial conditions 7 = a and7 = 0 whent = 0 give 
E=-} (1c 4. 2a”) 
so that the transformed energy equation for the lamina is finally given by 
i = 9? - = a’) : 


This equation is sufficient to determine the motion. On taking square roots and inte- 
grating, we find that the displacement of the lamina at time f is 


r =acosh Qt, 


which is the same answer as for a particle sliding along a rotating wire. & 


17.4 Problems 489 


Problems on Chapter 17 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Kinematics in rotating frames 


17.1 Use the velocity and acceleration transformation formulae to derive the standard expres- 
sions for the velocity and acceleration of a particle in plane polar coordinates. 


17.2 Addition of angular velocities Prove the ‘addition of angular velocities’ theorem, The- 
orem 17.13. [Hint. For a general particle of the rigid body, find v and v’ in terms of » and o’ 
respectively. Then relate v and v’ by the velocity transformation formula. ] 


17.3 A circular cone with semi-angle q@ is fixed with its axis of symmetry vertical and its 
vertex O upwards. A second circular cone has semi-vertical angle (7/2) — a and has its vertex 
fixed at O. The second cone rolls on the first cone so that its axis of symmetry precesses 
around the upward vertical with angular speed A. Find the angular speed of the rolling cone. 


Dynamics in rotating frames 


17.4 A particle P of mass m can slide along a smooth rigid straight wire. The wire has one 
of its points fixed at the origin O, and is made to rotate in a plane through O with constant 
angular speed (2. Show that r, the distance of P from O, satisfies the equation 


F-Q?r =0. 


Initially, P is at rest (relative to the wire) at a distance a from O. Find r as a function of ¢ in 
the subsequent motion. 


17.5 Larmor precession A particle of mass m and charge e moves in the force field F(r) 
and the uniform magnetic field Bk, where k is a constant unit vector. Its equation of motion is 


then 
dv eB 
m— =| — )vxk+ F(r) 
dt c 


in cgs Gaussian units. Show that the term (eB/c)v x k can be removed from the equation by 
viewing the motion from an appropriate rotating frame. 

For the special case in which F(r) = —may, r, show that circular motions with two 
different frequencies are possible. 


17.6 A bullet is fired vertically upwards with speed u from a point on the Earth with 
co-latitude 6. Show that it returns to the ground west of the firing point by a distance 
4Qu} sin B/3g?. 


17.7 An artillery shell is fired from a point on the Earth with co-latitude 6. The direction of 
firing is due south, the muzzle speed of the shell is uv and the angle of elevation of the barrel is 
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a. show that the effect of the Earth’s rotation is to deflect the shell to the east by a distance 


4Qu3 


392 sin? a (3 cosa cos B + sina sin B). 
& 


17.8 An artillery shell is fired from a point on the Earth with co-latitude 6. The direction of 
firing is due east, the muzzle speed of the shell is u and the angle of elevation of the barrel is 
a. Show that the effect of the Earth’s rotation is to deflect the shell to the south by a distance 


4Qu3 


7 sin? @ cos a cos B. 
3g 


* Show also that the easterly range is increased by 


4Qu3 


392 


sina sin B (3 — 4sin’ 9) : 
[Hint. The second part requires a corrected value for the flight time.] 


Energy conservation in rotating frames 


17.9 Consider Problem 17.4 again. This time find the motion of the particle by using the 
transformed energy equation. 


17.10 One end of a straight rod is fixed at a point O on a smooth horizontal table and the rod 
is made to rotate around O with constant angular speed Q. A uniform circular disk of radius 
a lies flat on the table and can slide freely upon it. The disc remains in contact with the rod 
at all times and is constrained to ro// along the rod. Initially, the disk is at rest (relative to the 
rod) with its point of contact at a distance a from O. Find the displacement of the disk as a 
function of the time. 


17.11 A horizontal turntable is made to rotate about a fixed vertical axis with constant angular 
speed Q. A hollow uniform circular cylinder of mass M and radius a can roll on the turntable. 
Initially the cylinder is at rest (relative to the turntable), with its centre of mass on the rotation 
axis, when it is slightly disturbed. Find the speed of the cylinder when it has rolled a distance 
x on the turntable. 

* Find also an expression (in terms of x) for the force that the turntable exerts on the 
cylinder. 


Hydrostatics in rotating frames 


When a fluid is at rest in an inertial frame, the equation of hydrostatics is 
0 = F — grad p, 


where p is the pressure field in the fluid, and F is the body force (force per unit volume) acting on the 
fluid. This equation means that the body force is balanced by the pressure gradient. In the case of uniform 
gravity, F = —pgk and, if pe is constant, the equation integrates to give the well known ‘hydrostatic pressure 
formula’ p = po — pgz, where Oz points vertically upwards. 

If a fluid is at rest in a rotating frame with a fixed origin and constant angular velocity @, then the 
equation of ‘hydrostatics’ in this frame is 


pQ2x(Qxr) = F — grad p, 
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FIGURE 17.7 A mass of liquid in equilibrium (left) and rotating like a rigid body (centre and 
right, cut in half to show the cross sections). The right figure shows the mass rotating with the 
critical angular speed. 


where p is the fluid density. (This equation means that the body force, the pressure gradient and the 
centrifugal ‘force’ must balance.) This is all you need to know to answer the following questions. 


17.12 Newton’s bucket A bucket half full of water is made to rotate with angular speed Q 
about its axis of symmetry, which is vertical. Find, to within a constant, the pressure field in 
the fluid. By considering the isobars (surfaces of constant pressure) of this pressure field, find 
the shape of the free surface of the water. 

What would the shape of the free surface be if the bucket were replaced by a cubical box? 


17.13 A sealed circular can of radius a is three-quarters full of water of density p, the remain- 
der being air at pressure po. The can is taken into gravity free space and then rotated about its 
axis of symmetry with constant angular speed 2. Where will the water be when it comes to 
rest relative to the can? Find the water pressure at the wall of the can. 


Computer assisted problem 
The following problem is suitable for a supervised project. 


17.14%% Ifa mass of liquid in gravity free space is in equilibrium, then it has the form of 
a sphere, stabilised by its own surface tension (see Figure 17.7 (left)). Suppose that the mass 
now rotates like a rigid body with constant angular velocity. What will its equilibrium shape 
be now? Show that, as the angular speed increases, the mass becomes more oblate until its 
top and bottom surfaces become quite flat (Figure 17.7 (centre)). Show further that, at higher 
angular speeds, the top and bottom surfaces approach each other until they meet (Figure 17.7 
(right)) and the mass breaks up. Find this critical angular speed for a sphere of mercury of 
radius 5 cm. 


Chapter Eighteen 
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Tensor algebra 
and the inertia tensor 


KEY FEATURES 

The key features of this chapter are the transformation formulae for the components of ten- 
sors and tensor algebra; the inertia tensor and the calculation of the angular momentum and 
kinetic energy of a rigid body; and the principal axes and principal moments of inertia of a 
rigid body. 


We have previously regarded a vector as a quantity that has magnitude and direction. 
Picturing a vector as a line segment with an arrow on it has helped us to understand essen- 
tially difficult concepts, such as the acceleration of a particle and the angular momentum 
of a rigid body. Neither of these quantities has any direct connection with line segments, 
but the picture is a valuable aid nonetheless. Useful though this notion of vectors is, it is 
not capable of generalisation and this is the main reason why we will now look at vectors 
from a different perspective. In reality, what we actually observe are the three components 
of a vector, and the values of these components will depend on the coordinate system in 
which they are measured. However, a general triple of real numbers {v1, v2, v3}, defined 
in each coordinate system, does not neccessarily constitute a vector. The reason is that the 
components of a vector in different coordinate systems are related to each other (in a way 
that we shall determine) and, unless the quantities {v;, v2, v3} satisfy this transformation 
formula, they are not the components of a vector. This leads us to a new definition of a 
vector as a quantity that has three components in each coordinate system, whose values in 
different coordinate systems are related by the vector transformation formula. 

Defined in the above way, vectors can be regarded as part of a hierarchy of entities 
called tensors with a tensor of order n having 3” components. A tensor of order zero, 
which has one component, is a scalar, and a tensor of order one, which has three com- 
ponents, is a vector. Tensors of higher order, which are entirely new objects, are defined 
to be quantities with 3” components that obey the appropriate transformation formula. 
In this chapter we give a gentle introduction to the principles of tensor algebra, which 
should be useful, not only for its present application to dynamics, but also in other areas 
such as relativity. Any account of tensor algebra is mathematics, not mechanics, and some 
readers may find this indigestible. However, there are very few basic principles involved 
and, once these are mastered, the subject seems (and actually is) rather easy. 
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FIGURE 18.1 The Cartesian coordinate systems C = Ox ,x2x3 and 

C’ = Ox}x5x4 have the associated unit vectors {e;, e2, e3} and 

{e',, e5, e4} respectively. The two systems actually have a common origin 
but are drawn separately for clarity. 


We then introduce the inertia tensor, a second order tensor that is used to calculate the 
angular momentum L and kinetic energy T of a generally rotating rigid body. It transpires 
that, by taking a special choice of axes, known as principal axes, the formulae for L and 
T are greatly simplified and reduce to non-tensorial forms. This makes it possible to solve 
most problems in rigid body dynamics without any knowledge of tensors at all.* The few 
results that one really needs in rigid body dynamics are given at the end of Section 18.6. 


18.1 ORTHOGONAL TRANSFORMATIONS 


Let C = Ox,x2x3 andC’ = OX 55%5 be two Cartesian coordinate systems having 
the common origin O and associated unit vectors {e;, e2, e3} and {e’. €,, e3} respectively 
(see Figure 18.1). Then any vector v (understood in the elementary sense of Chapter 1) 
can be expanded in the form 


v =v, e) + 2e2 + 33, (18.1) 


where {v1, v2, v3} are the components of v in the coordinate system C. Similarly v can be 
expanded in the form 


v= vie, +765 + v4¥es, (18.2) 


where {v}, v5, v3} are the components of v in the coordinate system C’. These two sets of 
components are related to each other. On taking the scalar product of the equality 


ve) + uses + uses = vier + v2e2 + 363 (18.3) 


* Don’t spread this information around! 
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successively with e’, e’, and e’, we find that 

1 = (21 + e1) vr + (e+ €2) v2 + (e+ 

vs = (e+ e1) v1 + (e+ e2) v2 + (€5 - e3) 03, 
3 = (€5 +1) v1 + (€5 + e2) v2 + (65 


which can be written in the matrix form 


v=A-v (18.4) 
where 
v} v1 (e, - e1) (e} + e2) (e) + €3) 
v= a: v=|w]. A= | (e+ e1) (eh 2) (e+ e3) |}, (18.5) 
Us v3 (e+ e1) (5 + e2) (€5 + e3) 


and the *-’ means the matrix product. This is the required relationship between the 
components of v in the two coordinate systems. The elements of the transformation 
matrix A depend only on the orientation of the system C’ relative to the system C. 
The inverse relationship to (18.4) can be found by taking the scalar product of equa- 
tion (18.3) successively with e1, e2 and e3. The result is that 


iss) 


Uh = (ere) 1+ (61-65) va le 
Uh = (€2-¢)) 01+ (2-64) ort (er: 
Ws = (en-)) 01+ (eae) va +(e 


—" 


i) 
wr Wr Wr 
— 
cS = ko 
Ww Wo Ww 


—” 


iss) 


which can be written in the matrix form 


v=A’.v (18.6) 


where A’ is the transpose of A. If we now substitute equation (18.4) into equation (18.6) 
we obtain 


v=A™-(A-v)=(A"-A)-v. 


Since this formula holds for every choice of v, it follows that A! .A =1, where 1 is the 
identity matrix. This means that the transformation matrix A has the special property that 
A’ = A“!. Such matrices are called orthogonal. 


Definition 18.1 Orthogonal matrix A matrix having the property that its transpose is 
also its inverse, that is, 


A’ .A=A.-A! =1, (18.7) 


is called an orthogonal matrix. 
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Thus the transformation matrix A, defined by (18.5), is an orthogonal matrix. For this 
reason, the transformation formula (18.4) is called an orthogonal transformation. 


Example 18.1 Prove a matrix is orthogonal 


Prove that the matrix 


, (4-1-4 
=, 1 4 8 
8-4 1 
is orthogonal. 
Solution 
1 4 1 8 4 7-4 1 81 0 O 
Bi -B=— 7 44 1 4 8 =e 081 O}] =1. 
—-4 8 1 8-4 1 0 O81 


There is no need to check that B - B’ = 1 as this now follows automatically. 


18.2 ROTATED AND REFLECTED COORDINATE SYSTEMS 


In this section we work out the transformation matrices that correspond to specific 
rotations and/or reflections of the coordinate system C. Important though this is, it is not 
needed for the general understanding of tensors. 

Suppose now that the coordinate system C’ is defined by a specified rotation of C 
about an axis through O. This notion needs a little thought. It should be remembered that 
the coordinate system C does not actually move; the ‘rotation’ of C is hypothetical and 
is merely a way of describing the orientation of C’ relative to C. What we are saying is 
that the orientation of C’ is the same as C would have if it were rotated in the prescribed 
manner. 

Consider first the special case in which C’ is obtained by rotating C through an angle 
w about the axis O.x3, as shown in Figure 18.2 (Left). In this case, 


14 / * 2 
e+ e1 =cosy e,:e2=sinw e,:e3=0 
ifs . / i 
e,-er =—sinw @,:e2=cosyp e,-e3=0 
e,°e,=0 é,+ @2 = 0 é,-e3;=1 


and the transformation matrix between C and C’ is therefore 


cosy siny 0 
A=] —sinwcosw 0]. (18.8) 
0 0 1 


Since this example is a special case of the preceding theory, the matrix A in (18.8) must 
be orthogonal. [You should check this directly.] A rotation of C about the Ox; or Ox2 
axis is treated in the same way. 
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FIGURE 18.2 Left: The coordinate system C’ = Ox}x‘.x‘ is obtained by rotating the 
coordinate system C = Ox ,x2x3 through an angle y about the axis O.x3. Right: The 
unit vector e’ is obtained by rotating the unit vector e through an angle y about the 
general axis {O, n}. 


Example 18.2 Rotation of C about a coordinate axis 


The coordinate system C’ is obtained by rotating the coordinate system C through an 
angle of 30° about the axis Ox. Find the transformation matrix A between C and C’. 
A vector v has components {3, —1, 2} inC. What are its components in C’? 


Solution 
The formula corresponding to (18.8) when C is rotated through an angle y about the 
axis Ox, 1s 
1 0 0 
A=]|0 cosy sinw 
0 —sinw cosy 


Hence, when w = 30°, the transformation matrix is 


1 0 0 
A=|0 V3/2 1/2 
0 -1/2 /3/2 


The components of the vector v in C’ are then given by the elements of the column 
vector Vv’, where 


3 1 0 0 3 3 
/=A.[-4]=|[0 73/2 1/2] [-4] =] 1-2v3 
) 0 =1/72 4/3/72 2 24+ 73 


The components of the vector v in C’ are therefore {3, 1 — 2/3 , 2+ V3 }. a 


In the general case, the rotation of C can take place about an axis {O, n}, where n is 
any unit vector. The argument is messier but we can still find an explicit formula for A. 


18.2 Rotated and reflected coordinate systems 497 


Figure 18.2 (Right) shows how a typical unit vector e’ of the coordinate system C’ is obtained from the 
corresponding vector e belonging to C. Let e be resolved into parts parallel and perpendicular to the axis 
{O, n} in the form 


e= el + et, 
where 
— L — 
e! =OM= (e-n)n, e- =ME=e—(e-n)n. 
Then, if e’ is similarly resolved as 
e’ _ Fal 4 et, 


it follows that e’!! = ell and 


eft = cos we! + sin y (nxet) = cos wv (e— (e-n)n)+siny (nxe). 


Hence e’ is related to e by the formula 
e’ =coswe+(1—cosy)(e-n)n+siny (nxe). (18.9) 


If we now take e = e], e’ = e and take the scalar product of equation (18.9) successively with e;, e2 and 
e3, we obtain 
e -e; = cosw(e;-e;)+(U —cosw)(n- e;)(n+ ey) + sinw(nxe1) +e} 
= cosw + (1 — cos wnt +0, 

e} eo = cosw(e, -e2) + (1 —cosw)(n- e1)(n+ en) + sinw(nxe,) + eo 

= 0+ —cosw)njn2 +73 siny, 
e -e3 = cosw(e, -e3) + (1 —cosw)(n- e1)(n- e3) + sinw(nxe,) + e3 
0+ (1—cosw)nyn3 —n2siny, 


where 71, 12, 3 are the components of the vector n in the coordinate system C. This gives the elements of 
the first row of the transformation matrix A, and the elements of the second and third rows can be found in 
a similar manner. The full transformation matrix A is given by 


cos w + (1 — cos wnt (l—cosy)nayn2+n3sinw (1 —cosw)nyn3 — nz sinw 

(1 —cosy)nogn; —n3siny cosw-+ (1—cos wns (1 —cos wW)non3 +n, siny (18.10) 

(1 —cosy)n3ny +nzsiny (1—cosw)n3n2 —n,sinw cosw-+ (1 —cos wns 
This is the transformation matrix A when C’ is obtained by rotating C through an angle 
w about the axis {O, m}; 11, n2, n3 are components of the unit vector n in the coordinate 
system C. 


Example 18.3 Rotation of C about a general axis 


The coordinate system C’ is obtained by rotating the coordinate system C through an 


— 
angle of 60° about the axis OP, where P is the point with coordinates (1, 1, —1) in 
C. Find the transformation matrix A between C and C’. 

A vector v has components {3, —6, 9} in C’. What are its components in C? 


Solution 


The unit vector n in the direction O P has components {1//3, 1/3, —1//3} in C, 
and w = 60°. It follows from the formula (18.10) that the tranformation matrix from 
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C toC’ is 
2-1-2 
A=-|2 2 1 
1-2 2 


The components of the vector v in C are then given by the elements of the column 
vector v’, where 


3 ‘ 2.2: 3 1 
v=A’. 6 =5 1 2-2 6|/= 11 
9 —3 7.2 9 2 


The components of the vector v in C are therefore {1, —11, 2}. @ 


Reflections 
The coordinate system C’ may also be defined by reflecting C in a plane through O. For 
example, if C is reflected in the coordinate plane Ox,x2, the transformation matrix is 


easily found to be 


10 0 
A=|0 1 0 
0 0-1 


In the general case, if C is reflected in the plane through O with unit normal vector n, then 
the transformation matrix is 


1- 2nt —2nin2 —2n\n3 
A= | —2n2n, 1—2n} —2non3 |, (18.11) 
—2n3n, —2n3n2 1|— 2n% 


where 711, 72, N3 are the components of the vector n in C. [The method of proof is similar 
to that for the general rotation. ] 


Example 18.4 Reflection of axes 


The coordinate system C’ is obtained by reflecting the coordinate system C in the 
plane x3 = 2x; + 2x2. Find the transformation matrix A between C and C’. 

Find also the transformation matrix when C’ is obtained from C by performing 
the above reflection followed by a rotation of 90° about the new x3-axis. 


Solution 
The equation x3 = 2x; + 2x2 can be written in the form 


(x11 + x2e2 + x3€3) + (2e; + 2e2 — €3) 


and, by comparison with the standard vector equation r -n = p of a plane, we see 
that the unit vector 
2 2 1 


ha et 302 3° 
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is normal to the reflection plane. Hence nj = 2/3, no = 2/3 and n3 = —1/3. It 
follows from the formula (18.11) that the transformation matrix between C and C’ is 


(1-8 4 
A=5[-8 1 4 
4 47 


If this transformation is followed by a rotation whose transformation matrix is 
010 
-1 00 
00 1 


then the overall transformation has matrix 


0 10 1-8 4 1 -8 1 4 
~|-1 0 0 -8 1 4 = —-1 8-4]. 
00 1 4 4 7 44 7 


It can be proved that the most general orthogonal transformation corresponds either 
to (i) a rotation of coordinates, or (11) a reflection followed by a rotation. Which of these 
classes a particular orthogonal transformation belongs to can be decided by examining the 
sign of the determinant det A. 

If we take determinants of equation (18.7) we obtain 


det = det (a -A) = det A? x det A = det A x detA = (det A)”, 


from which it follows that det A can only take the values +1. Surprisingly, we can work 
out the determinant of the general A defined by equation (18.5) quite easily. The first row 
of A contains the three components of the vector e’, in the coordinate system C. Likewise, 
the second and third rows contain the components of the vectors e, and e4. It follows from 
the vector formula (1.7) that, provided C is a right-handed coordinate system, 


det A = (e) xe) - e3. 


We can now see how the different signs for det A can occur. If C’ is also a right-handed 

coordinate system, then the triple scalar product (e' x e) -e = +1, whereas if C’ is a 
left-handed system then (e/ xe’) - e; = —1. If two coordinate systems have the same 
handedness, then one can be made coincident with the other by a suitable rotation about 
an axis through O. If they have opposite handedness then a reflection in a plane through 
O is also needed. For this reason, orthogonal matrices with determinant +1 are sometimes 
called rotation matrices. 


18.3 SCALARS, VECTORS AND TENSORS 


Our view of a vector as a quantity that has magnitude and direction has served 
us well. By picturing a vector as a line segment with an arrow on it, we have been able 


500 Chapter 18 Tensor algebra and the inertia tensor 


to understand essentially difficult concepts, such as acceleration and angular momentum, 
far more easily. Neither of these quantities has any direct connection with line segments, 
but the picture is a valuable aid nonetheless. However, this notion of vectors cannot be 
generalised to tensors. This is the main reason we will now look at vectors (and even 
scalars) in a different light. 


Scalars 
We begin by giving the true definition of a scalar. 
Definition 18.2 Scalar Let ¢ be a real number defined in each coordinate system.* 


Then @ is said to be a sealar if it has the same value in every coordinate system. That is, 
for any pair of coordinate systems C and C’, 


Definition of a scalar 


o =¢. 


(18.12) 


This can be alternatively expressed by saying that a scalar is invariant under change of 
coordinate system. 


It is this invariance under change of coordinate system that is the essential feature of 
a scalar. Thus the mass of a particle and the length of a line are scalars. However, the 
sum of the coordinates of a given point of space (which is a single real quantity defined in 
each coordinate system) is not a scalar since it is nof invariant under change of coordinate 
system. 


Vectors 


We turn now to the definition of a vector in terms of its components. The individual 
components of a vector are certainly not invariants. They transform according to the 
transformation formula (18.4). It is this transformation formula that becomes our new 
definition of a vector. 


Definition 18.3 Vector Let {v1, v2, v3} be a set of three real numbers defined in each 
coordinate system. Then {v,, v2, v3} are said to be the components of a vector if their val- 
ues in any pair of coordinate systems C and C' are related by the transformation formula 


Definition of a vector (matrix form) 
(18.13) 
v=A-v 


where A is the transformation matrix between C and C’'. 


* These are rectangular, Cartesian coordinate systems with common origin O. 


18.3 Scalars, vectors and tensors 501 


This definition of a vector is clearly consistent with our previous point of view. Note 
that a set of three real numbers defined in each coordinate system does not in gen- 
eral constitute a vector; the components must be related to each other by the transfor- 
mation formula (18.13). For example, the Cartesian coordinates (b;, b2, b3) of a par- 
ticle B are a set of three real numbers defined in each coordinate system and these 
numbers are related by the transformation formula (18.13). Thus {b;, b2, 53} is a 
vector. Suppose now that B is a moving particle with time dependent coordinates 
(b1(t), ba(t), b3(t)). Then, at each time t, {51 (t), b(t), b3(t)} is also a set of three of 
real numbers defined in each coordinate system. The transformation formula (18.13) 
is again satisfied since the matrix A does not depend upon the time. Hence, at each 
time ¢, {b1(t), b2(t), 63(t)} is a vector. This is, of course, the velocity vector of the 
particle B at time f, and a second differentiation yields the acceleration vector of B. 
In contrast, {2b,, b2, b3} is a set of three real numbers defined in each coordinate sys- 
tem, but the transformation formula is not satisfied. Therefore {2b;, b2, b3} is not a 
vector. 


Tensors 


In order to generalise our definition of a vector, it is necessary to write the vector transfor- 
mation formula (18.13) in the following suffix form: 


Definition of a vector (suffix form) 
2 (18.14) 


UE y aij vj 


j=l 


(1 < i < 3), where apg is the element in the p-th row and q-th column of the 
matrix A. The matrix product in equation (18.13) is equivalent to the summation in 
equation (18.14). 

Defined in the above way, vectors can be regarded as part of a hierarchy of entities 
called tensors with a tensor of order n having 3” components. A tensor of order one, 
which has three components, is a vector; a tensor of order zero, which has one compo- 
nent, is a scalar. The general definition of tensors looks extremely daunting when first 
encountered. Below we give the definitions of second and third order tensors, from which 
the general case can be inferred. 


Definition 18.4 Second order tensor Let {t;j} (1 < i,j < 3) be a set of nine real 
numbers defined in each coordinate system. Then the {t;;} are said to be the components 
of a second order tensor if their values in any pair of coordinate systems C and C’' are 
related by the transformation formula 
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Definition of a second order tensor (suffix form) 


3. 3 
P 
i = Dy Dain aii te 


k=1 /=1 


(18.15) 


(1 <i, j < 3), where the {apq} are the elements of the transformation matrix between C 
and C’. 


Definition 18.5 Third order tensor Let {t;jx} (1 < i, j,k < 3) be a set of twenty 
seven real numbers defined in each coordinate system. Then the {t;jx} are said to be the 
components of a third order tensor if their values in any pair of coordinate systems C and 
C’ are related by the transformation formula 


Definition of a third order tensor (suffix form) 


3 3 3 (18.16) 


/ 


/=1 m=1 n=1 


(1 <i, j,k < 3), where the {dapq} are the elements of the transformation matrix between 
C and C’. 


Notes on the tensor transformation formulae 


The transformation formulae (18.14), (18.15), and (18.16) may seem incomprehensible 
but they do follow a pattern. In the definition of a vector there is only one summation 
and one appearance of dp ,; in the definition of a tensor of the second order, there are two 
summations and two appearances of a,,, and so on. Correct positioning of the suffices is 
vital. The suffices of the tensor on the left (in order) must be the same as the first suffix of 
each of the apg (in order); and the suffices of the tensor on the right (in order) must be the 
same as the second suffix of each of the dp (in order).* 

Power users of tensors write formulae such as those above without the summation 
signs. They adopt the summation convention’ that any repeated suffix is deemed to 
be summed over its range. Although the summation convention is widely used in most 
applications of tensors, we do not use it here. This is for two reasons: (i) it is a good thing 
to see the summation signs written in when tensors are first encountered, and (ii) we have 
no need of the heavy tensor algebra for which the summation convention was designed. 


* Be a hero. Write out the transformation formula for a fourth order tensor. 
+ Invented by Einstein to simplify the writing of the theory of general relativity. 
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Second order tensors in matrix form 


In the general case it is not possible to give any simpler form for the tensor transformation 
formula. However, for the special case of tensors of the second order (which is what we 
are mainly concerned with), the transformation formula can be written in a more user- 
friendly form in terms of matrix products. 

In each coordinate system, a second order tensor {t;;} has nine components, indexed 
by the integers7, j (1 <i, j < 3). Itis natural then to display these components as a 3 x 3 
array, that is, as the elements of a 3 x 3 matrix. The tensor {f;;} can then be regarded as a 
3 x 3 matrix 


fy f12 13 
T= [| tar 22 #23 
131 132 133 


defined in each coordinate system. Also, the tensor transformation rule (18.15) can be 
wrtten in the form 


3 3 


3. 3 
- 
=>) > anajta = >~ So aik te ajl 


k=1 1 i=1 \k=1 
3/3 
T 
=D | diaieta J aij, 
I=1 \k=1 
T 


Pq 
expression are equivalent to matrix products and so the transformation rule for a second 


where the {a7} are the elements of the transposed matrix A’. The summations in this last 


order tensor can be expressed in the form 
T=A.T-A’. 


This gives the following alternative definition: 


Definition 18.6 Second order tensor (matrix form) Let T be a3 x 3 matrix defined 
in each coordinate system. Then T is said to represent a second order tensor if its values 
in any pair of coordinate systems C and C' are related by the transformation formula 


Definition of a second order tensor (matrix form) 
(18.17) 
T’=A.-T-A? 


where the A is the transformation matrix between C and C’. 


This definition has the virtue of being expressed in terms of the familiar operation of 
matrix mutiplication. It only applies to second order tensors but that is exactly what we 
need. The inertia tensor is second order, as are many other important tensors of physics. 
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Example 18.5 Transforming a second order tensor 


In the coordinate system C, a second order tensor has the components f}; = 1, t12 = 2, 
13 = 3, to) = 4, foo = 5, fo3 = 6, f3) = 7, f32 = 8, 133 = 9. Find the components of 
the tensor in C’ when 


(i) C’ is obtained from C by a rotation of 90° about the axis O.x3, and 


(ii) when C’ is obtained from C by a reflection in the plane x3 = 0. 


Solution 
In the coordinate system C, the matrix of components is given to be 


i 23 
T=[45 6 
789 


Part (i). From the formula (18.8), the transformation matrix of the required rotation 
of coordinates is 


0 1 0 
A=j|-1 0 0 
001 


The transformation formula (18.17) then implies that, in the coordinate systen C’, the 
tensor is represented by the matrix 


01 0 12 3 0 —-1 O 
T=A.-T-A’ =| -100]][4 5 6 1 00 
00 1 78 9 0 O01 
5-4 6 
=]—2 1-3 
8-7 9 
Hence the components of the tensor in C’ are t}, = 5, t}, = —4, t}; = 6,4, = —2, 
ty = ly hg 3, = 8p = — Tg = 9. 


Part (11). The transformation matrix of the required reflection of coordinates is 
10 0 
A={]0 1 0 
0 0-1 


The transformation formula (18.17) then implies that 
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Hence the components of the tensor in C’ are a —e i =-2, ts = —3, oi = 4, 
/ td / / / 
by = 5,1, = —6,14, = —7, 4, = —8,4,, = 9. 


The identity tensor 


The simplest second order tensor is the identity tensor. This tensor has the components 
{6;;} in every coordinate system, where 4;; is the Kronecker delta, defined by 


lt Sz) 
U0 G#A 


In each coordinate system, the identity tensor is represented by the identity matrix 1. The 
transformation formula (18.17) is certainly satisfied since 


A-1-A’=A-A’ =1, 


thus confirming that {4;;} is a tensor. 

The identity tensor one of a very restricted class of tensors that have the same com- 
ponents in all coordinate systems. Such tensors are called isotropic and they have an 
important physical rdle. All scalars are, by definition, isotropic; there are no (non-trival) 
isotropic vectors; and the only isotropic second order tensors are scalar multiples of the 
identity tensor. It becomes increasingly difficult to identify the isotropic tensors of higher 
orders! 


18.4 TENSOR ALGEBRA 


From the linearity of the transformation formulae, it follows that a scalar mul- 
tiple of a tensor is a tensor, and the sum of two tensors (of the same order) is a tensor. 
We will now look at two other ways in which new tensors can be created. This gives an 
indication how tensors can arise naturally. 


The outer product of two tensors 


Suppose, for example, that {w;;} is a second order tensor and {v;;;} a third order tensor. 
Then {tj jim} defined* by 


lijklim = UijVkim 


is a 243 component quantity, defined in each coordinate system. Although we will not 
prove this here, {tj jim} is a fifth order tensor by virtue of satisfying the appropriate 
transformation formula. This is an example of the outer product of two tensors. In 
general, the outer product of a tensor of order m and a tensor of order n is a new tensor 
of orderm +n. 


* To see what this means, write out the definitions of a few elements of {¢;;</m}; for example, 113231 = 
U43V231. Note that, in an outer product, all the suffix names must be different. 


506 Chapter 18 Tensor algebra and the inertia tensor 


The outer product is one way in which higher order tensors can be constructed from 
lower order ones. For example, suppose {u;} and {v;} are vectors (tensors of order one). 
Then the nine component quantity {u;v;} is a second order tensor whose matrix form is 


UV UjVZ UjVZ 
U2V{ U2V2 U2V3 
U3V{_ U3V2 U3V3 


In the same way, if {u;}, {v;} and {w;} are vectors, then the twenty seven component quan- 
tity {u;v;w,;} is a third order tensor. Not all higher order tensors can be so constructed, 
but it is acommon procedure. The inertia tensor makes use of this construction. 


Contraction of a tensor 


Suppose, for example, that {f;;,7} is a fourth order tensor. Now select two of its suffices 
(k and / say), set them equal (to m say)* and sum over the suffix m. The result is the nine 
component quantity 


3 
Wij = ) tijmms 


m=1 


defined in each coordinate system. Although we will not prove this here, {wj;;} is a second 
order tensor by virtue of satisfying the appropriate transformation formula. The tensor 
{w;;} is called the contraction of {¢;;,;} with respect to the suffix pair k, /. [ There are 
six different contractions of {¢;;4;}. What are they?] In the general case, contraction of a 
tensor produces a new tensor whose order is two less than that of the original tensor. 

Contraction of a second order tensor is the simplest case. If {t;;} is a second order 
tensor, then its contraction with respect to the suffix pair 7, 7, namely, 


3 
Se tii =t1+22 + 63, 


i=1 


must be a tensor of order zero, that is, a scalar invariant. If {t;;} is represented by the 
matrix T, then the contraction ft}; + f22 + f33 is the sum of the diagonal elements of T. 
In linear algebra this is called the trace of T. The fact that the trace of a second order 
tensor is invariant' is an important result. For example, suppose {u;} and {v;} are vectors. 
Then the outer product {u;v;} is a second order tensor. It now follows that the trace of 
this tensor, namely u4v1 + u2v2 + u430v3 1s an invariant. This is actually no surprise, since 
uv, + uU2v2 + u303 Is the scalar product of the vectors {u;} and {v;}, which is known to 
be independent of the coordinate system in which the components are measured. 


* The name of this repeated suffix can be any name not already in use. However, it is permissible to re-use 
either of the old suffix names that were set equal (k or / in this example). 
+ This result is by no means obvious. After all, the individual components of {t; j} are not invariants. 
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Another important application of contraction is as follows. Suppose that {f;} is a 
second order tensor and that {u;} is a vector. Then the outer product {t;;ux} is a third order 
tensor. If we now contract this third order tensor with respect to the suffix pair 7, k, we 
obtain the vector {w;} given by 


3 
wi= So tijuj. 
j=l 


This formula looks nice in the matrix formulation. If {t;;} is represented by the matrix T 
and {u;} by the column vector u, then {v;} 1s represented by the column vector V given by 


v=T-u. 


In other words, if the matrix T represents a second order tensor and the column vector 
U represents a vector, then the product T - U represents a vector. Thus pre-multiplicaton 
by T transforms one vector into another vector. This is a common way in which tensors 
act in physics. For example, in crystalline materials, the electric vectors E and D are not 
parallel. In each coordinate system, they are related by the formula 


D=K.-E, 


where the 3 x 3 matrix K represents the dielectric tensor, the crystalline equivalent of 
the dielectric constant. This tensor relationship between D and E is the cause of double 
refraction in crystalline materials. The inertia tensor has a similar rdle, transforming the 
angular velocity vector into the angular momentum vector. 


The angular velocity vector 


We have already shown that the velocity of a particle is a vector in the sense of the defi- 
nition (18.13), but we have so far said nothing about the transformation properties of the 
angular velocity of a rigid body. This is made more awkward to decide by the fact that w 
is defined indirectly. Suppose that one of the particles of a rigid body is fixed at the origin 
O. Then @ is essentially defined to be that ‘vector’ that gives the velocities of the particles 
of the body by the formula v = » xr. In each coordinate system this formula can also be 
expressed by the matrix product 


v=Q-xX, 
where 
VI 0 —-@3 @2 nal 
V=]u], Q= 3 0 —a, |, X=] Xo 
U3 —-02 | 0 X3 


[Check this.] Now {x;} and {v;} are known to transform as vectors, but nothing is known 
about the transformation properties of the matrix Q. However, it can be proved that if 
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a matrix maps vectors into vectors by matrix multiplication, then that matrix represents 
a second order tensor. Hence Q transforms as a second order tensor. This must, in turn, 
imply a rule for transforming the {@;}. It looks as if this rule will be something horrendous, 
but, amazingly, it turns out that the rule is 


QW) @) 
wi, | = (det A) A- | o> |, (18.18) 
ws W3 


which is not quite the transformation formula for a vector because of the factor det A. If the 
transformation A acts between systems with the same handedness, then det A = +1 and 
(18.18) is the vector transformation formula. However, if the transformation acts between 
systems with opposite handedness, then det A = —1 and the quantity produced by (18.18) 
has the wrong sign. Three-component quantities that have this strange behaviour are 
known as pseudovectors. Pseudovectors are reasonably common. The cross product 
of any two (genuine) vectors is a pseudovector, not a vector. In particular, the moment of 
a force and the angular momentum of a body are pseudovectors. We do not wish to make 
an issue of this distinction. Instead, we will restrict all our coordinate systems to be right- 
handed. With this restriction, det A is always +1 and {@;} (and all other pseudovectors) 
transform as vectors.* We will therefore have no need to distinguish between vectors and 
pseudovectors and we will call them all ‘vectors’. 


18.5 THE INERTIA TENSOR 


Suppose the rigid body & has one of its particles held fixed at the origin O, but 
is in otherwise general motion. Then the angular momentum of 6 about O is defined by 


N 
Lo= yr x (m;0;) (18.19) 


i=1 


in the standard notation, where the position vectors {r;} are measured from the origin O. 
In Chapter 11 we found the formula for the angular momentum of a rigid body about 
its own rotation axis; this angular momentum is a scalar quantity. Now we will find the 
formula for the full vector value of Lo. 

In order that we may introduce component suffices without confusion, it is convenient 
to omit the suffix 7 in formula (18.19) and simply write 


Lo =) rx(mv), (18.20) 


* Tt is an interesting fact that angular velocity exists, as a vector, only in a space of three dimensions. Had 
the universe been created with (say) four spatial dimensions, then X and v would be (four-dimensional) 
vectors and v would be given by the formula v = 9 - x, where Q is a 4 x 4 anti-symmetric matrix 
representing a second order tensor. This matrix has six independent elements which cannot be fitted into 
a column of length four ! The same applies to any spatial dimension other than three. 
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where the sum is deemed to be taken over all the particles of B. 
Since B is a rigid body and O is fixed, the velocity v is given by v = w xr, where @ 
is the angular velocity of the body. Then 
rxv=rx(@xr) 
=(r-r)@®—-(r-@)r. 


If the typical particle P has coordinates (x1,.x2,x3) and @ has the components 
{@1, ©2, @3}, then the 7-th component of r x v can be written 


(rxv)i = (x7 +x5+ 3) oj — (x1@1 + X22 + X33) Xi 


3 3 


= ) XKXk | Wi — ) Xj Wj | Xi- 


k=1 jel 


What we want to do now is to make w; a factor of this expression, which means that we 
would like @; to be replaced by w;. There is a standard trick for doing this, that is, to write 


3 
Oi = yo Wj, 
i 


where 6;; is the Krénecker delta. [This is equivalent to pre-multiplication of the column 
vector w by the identity matrix.] We then obtain 


3 3 3 
(rxv); = ya So bij@; = So xjoj Xi 
k=1 j=l ja 
3 3 
= > So xexe bij —XiXj | @; 
j=l k=1 


It follows that if {L1, £2, £3} are the components of the angular momentum Lo, then 
3 
Lem) 1p ay, (18.21) 
j=l 


where the nine-component quantity {J/;;} is defined by 


3 
hij = om So rere bij — XiXj 
k=1 


It is easy to show directly that {/;;} is a second order tensor. For each particle of the 
body, the Cartesian coordinates {x , x2, x2} transform as a vector. It follows that the outer 
product {x;x;} is a tensor and its contraction ae XxX; 1S a scalar. It then follows by 
linearity that 34 1 xexk) dj; — xix; 1s a tensor. Since the particle masses are certainly 


invariants, it once again follows by linearity that {/;;} must be a tensor. 
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Definition 18.7 Inertia tensor The second order tensor {Ij;} defined by 


Definition of the inertia tensor 


, (18.22) 
lj = dom (x5) bij — Xi “] 


k=1 


is called the inertia tensor of the body B at the point O. Note that the inertia tensor is 
symmetric, that is, Ij; = I;;. 


We have therefore proved that, when a rigid body is rotating with angular velocity w 
about an axis through origin O, its angular momentum Lg about O is given by 


Ly Ty ty2 113 | 
Lo |} = J 1p In2 13 @2 |. (18.23) 
L3 131 [32 133 03 


This can be written in matrix form as 


Angular momentum formula 
(18.24) 
Lo =I-w 


where the column vectors Lg and w are formed from the components of Lg and @ respec- 
tively, and I is the matrix form of the inertia tensor {J;;}. Because Lo is obtained from 
w by a matrix multiplication, it follows that Lo and @ will generally lie in different 
directions. This fact (which does not show itself in planar mechanics) is what gives three- 
dimensional rigid body motion its special character. 

The inertia tensor also appears in the corresponding expression for the kinetic energy 
of B. By following a similar procedure to that used for angular momentum, it is found 
that T is given by 


Thi tio 13 @| 
T = 5 (@1 @ 3) | M21 Ino hs w2 (18.25) 
Tay I32 I53 3 


which can be written in the matrix form 


Kinetic energy formula 
(18.26) 
T= 5 w «1st 
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The elements of the inertia tensor 


The diagonal elements of I are actually familiar quantities. For example 


i= yim ((x? xe + x3) 611 — x11) 
= yim (x3 +x3) 
= Dv 


where p, is the perpendicular distance of the typical particle P from the axis Ox;. Hence 

J, is just the ordinary moment of inertia of the body about the axis Ox,. Similarly, /22 

and /33 are the moments of inertia of the body about the axes Ox2 and Ox3 respectively. 
The off-diagonal elements of I are given by 


Ie = bj == So mxix2, (18.27) 
I93 = [32 = — So mxx3, (18.28) 
hai Yo mx3x1, (18.29) 


where, as usual, the sum is taken over the particles of the body. The quantities }* mx1x2, 
> mx2Xx3, > mx3x, are known as products of inertia. Hence the off-diagonal elements 
of I are the negatives of their corresponding products of inertia.* Our results are sum- 
marised as follows: 


Elements of the inertia tensor 
e The diagonal elements of the inertia tensor, namely 111, /22, 133, are the ordi- 
nary moments of inertia of the body about the coordinate axes Ox, Ox2, Ox3 
respectively. 


e The off-diagonal elements of the inertia tensor are the negatives of their corre- 
sponding products of inertia. 


Products of inertia are new quantities and the reader will probably expect that we 
should now give several examples on their calculation. In fact, we will give just one easy 
one, which is a part of the next example. The reason is that, in almost all problems, 
the products of inertia can be inferred from symmetry and known results. Only a body 
with virtually no symmetry would require products of inertia to be calculated from first 
principles. 


* Some authors include the negative sign in their definition of product of inertia, but this seems unnatural. 
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FIGURE 18.3 The uniform rectangular plate has mass M and sides 2a and 2b 


Example 18.6 Calculating the inertia tensor 


A uniform rectangular plate has mass M and sides 2a and 26 as shown in Figure 18.3. 
Find the inertia tensor at the points G and C in the coordinate systems shown. 


Solution 

Consider first the centre of mass point G. Then /;; is the moment of inertia of the 
plate about the axis Gx; which is given in the Appendix (at the end of the book) 
to be 5M b*. Similarly, In. = 3M a? and, by the perpendicular axes theorem, /33 = 
Iyytin = 3M (a2 +b). The products of inertia are all zero. The products }* mx2x3 
and )* mx3x 1 are zero because all the mass lies in the plane x3 = 0. The product 
>= mx x2 is zero because of symmetry; the plate is symmetrical about the axis x; = 0 
but the terms of the sum are odd functions of x;. The contributions to the sum from 
the right and left halves of the plate therefore cancel. Hence /93 = J3; = [12 = 0. 
Thus, in the given coordinate system, the inertia tensor at G is represented by the 
matrix 


‘ b 0 0 
2 
0: 0 at 


Now consider the corner point C. Then /;; is the moment of inertia of the 
plate about the axis Cx, which is given by the parallel axes theorem to be 5M be + 
Mb? = 3M b?. Similarly, In. = iM a? and, by the perpendicular axes theorem, 
33 = htiIn= 4M (a? +b). The elements />3 and /3; are zero for the same 
reason as before. However, the product }* mx x is not zero now because the plate 
is no longer symmetrically placed relative to the coordinate system based at C. We 
will therefore evaluate this product of inertia from first principles. Let the plate have 
uniform mass per unit area o. Then 


2a 2b 
Yi mxixe = | / ox,x2 dx{dx2 = Aoa7b* = Mab, 
1=0 x2=0 
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since M = 4oab. Hence /;2 = Mab. Thus, in the given coordinate system, the 
inertia tensor of the plate at C is represented by the matrix 


, Ab? —3ab 0 
Ic= gM —3ab 4a? 0 a 
0 0 4(a?+4+ 5b?) 


Example 18.7 Transforming the inertia tensor 


Find Ig in the set of axes obtained by rotating the axes Gx ,x2x3 about Gx3 so that 
the new axis Gx; lies along the diagonal GD. 


Solution 

In order to achieve the required position, the axes Gx1x2.x3 must be rotated through 
the acute angle ~w = tan~!(b/a). The corresponding transformation matrix is given 
by the formula (18.8) to be 


cosa sina 0 
A=] —sina cosa 0 


0 01 


The matrix IG representing the inertia tensor at G in the rotated coordinates is given 
by the transformation formula (18.17) to be 


I, =A-Ic-A’ 


1 cosa sina 0 b? 0 0 cosa —sina 0 
=, — sina cosa 0 0 a2 0 sina cosa 0 
0 0 1 0 0 a*+b 0 0 1 
1 a? sin? a + b* cos? a (a? — b*) sina cosa 0 
= gf (a? — b*) sinacosa a* cos? a + b* sin? a 0 
0 0 a* + b? 
Py 2a*b? — ab(a? — b?) 0 
2) 2 4 4 
= 5 ab(a* —b*) a*+b 0 : 
eee) 0 0 @Wt+Py 


on inserting the values sina = b/(a* + b*)!/? and cosa = a/(a? + b*)!/2. This is 
the inertia tensor of the plate at G in the rotated coordinate system. & 


Example 18.8 Finding kinetic energy using the inertia tensor 


Suppose that the plate in Figure 18.3 is made to rotate about one of its diameters with 
angular speed i. Find its kinetic energy. 


Solution 
The kinetic energy can be evaluated by using either the original or the rotated coordi- 
nate system. 
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In the original coordinate system, 


T= 5w'-Ig-w 


bh 0 0 Acosa 
= 4M (Acosa Asina 0) 0 a2 0 A sina 
0 0 a+b 0 


= eMi? Gi cos? a ++ a’ sin? a) 
_ Ma2b?}2 
~ 3(a2 +b)" 


In the rotated coordinate system, 


T= }w'-Ig-w 


2a*b* ~~ ab(a* — b?) 0 ry 
M 2 2 4 4 
= Gat 45D (499) ab(a* —b*) a*t+b 0 0 
0 0 (a? + b*)? 0 
_ Ma?*b?)? 
 3(a2 +b?) 


Starting from scratch, the first calculation is quicker. ll 


18.6 PRINCIPAL AXES OF A SYMMETRIC TENSOR 


The nine components of a second order tensor depend on the coordinate system, 
their values in different coordinate systems being related by the transformation formula 
(18.17). The question naturally arises as to whether we can simplify the representation of 
a tensor by a clever choice of coordinate system. If a tensor is represented by the matrix 
T in one coordinate system C, then, in another coordinate system C’, it is represented by 
the matrix T’ given by 


T=A-T-A’, (18.30) 


where A is the orthogonal transformation matrix between C and C’. We therefore wish to 
find an orthogonal matrix A (with determinant +1) that makes the product A- T- A’ as 
‘simple’ as possible, preferably a diagonal matrix. The question is therefore equivalent 
to a problem in linear algebra. There are a number of theorems available concerning the 
diagonalisation of matrices by transformations resembling (18.30) (see Anton [7]) and one 
of these results is exactly what we need. It is the orthogonal diagonalisation theorem: 


Theorem 18.1 Orthogonal diagonalisation Given any real symmetric matrix T, 
there is an orthogonal matrix A such that the product A+ T - A’ is a diagonal matrix. 


Since the inertia tensor is symmetric, the above theorem applies. Thus, at any point 
O of the body, one can find a set of axes in which I is a diagonal matrix. These are called 
principal axes of the body at O. The diagonal elements of I are simply the moments of 
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inertia of the body about the three principal axes. They are called the principal moments 
of inertia at O and will be denoted by symbols A, B and C. This is summarised as 
follows: 


Principal axes and principal moments of inertia 


Relative to a set of principal axes Ox,x2x3 at O, the inertia tensor I has the 
diagonal form 


fe [0B 201. (18.31) 


where the principal moments of inertia A, B, C are the moments of inertia of the 
body about the axes Ox), Ox2, Ox3 respectively. 


Finding principal axes 


There is a standard procedure for finding the orthogonal matrix A that reduces any sym- 
metric matrix T to diagonal form in the above manner. The diagonal elements of the 
reduced matrix are the eigenvalues of the matrix T, and the rows of A contain the com- 
ponents of the corresponding normalised eigenvectors of T. However, for the purpose of 
solving problems in mechanics, we (almost) never need to carry through this procedure. 
The reason is that, in almost all problems, the orientation of the principal axes can be 
deduced by symmetry, as we will show in Section 18.7. 


Expressions for L and 7 in principal axes 


When principal axes are used, the expressions (18.24) for the angular momentum and 
(18.26) for the kinetic energy of a rigid body are much simplified. They reduce to the 
non-tensorial expressions 


Expressions for Lo and 7 in principal axes at O 
Lo = (Aa) e1 + (Ba) e2 + (C@3) €3 (18.32) 


T = 5 (Aas + Bos + Co) 


where {e1, €2, e3} are the unit vectors of the principal axes, and {@1, 2, w3} are the com- 
ponents of @ in the principal axes, that is, @ = w,e; + w2e2 + w3e3. 

These results apply to the case in which the body has a permanently fixed particle O 
(the vertex of a spinning top, for example). However, similar formulae apply to the case 
of completely general motion. In this case we have 
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Expressions for Lc and 7 in principal axes at G 
Lg = (Aq) e; + (B a2) e2 + (C@3) €3 (18.33) 


TG 


5 (Aw? + Ba; + Ca) 


where G is the centre of mass of the body. Here Lg is the angular momentum of the body 
about G, and T is the rotational kinetic energy of the body about G. 

These are the results that you really need in order to solve problems in rigid body 
dynamics. Almost always, principal axes are used and only the equations (18.32) or 
(18.33) are required. Thus, it is possible to solve most problems in rigid body dynam- 
ics without any knowledge of tensors at all! 


18.7 DYNAMICAL SYMMETRY 


Finding principal axes from geometrical symmetries of the body 


The directions of the principal axes of the inertia tensor of a body at a point O can often 
be inferred from the geometrical symmetry of the body about O. The following rules 
(which are consequences of the tensor transformation formula) are useful. 


Rule 1: /f the body has reflective symmetry in a plane through O, then the line through 
O perpendicular to this plane is a principal axis. 


Rule 2: If the body has any rotational symmetry about an axis through O, then this axis 
is a principal axis. 


Either of these rules is enough to show that, for a uniform rectangular plate, the 
axes shown in Figure 18.3 (left) are a set of principal axes at G. The plate has reflective 
symmetry in each of the coordinate planes, and also has rotational symmetry of order two 
about each of the coordinate axes. The set of parallel axes at C is not a principal set. For 
a uniform cone on a pentagonal base (see Figure 18.4 (left)), the axis of symmetry is a 
principal axis at G and there are five other principal axes through G. [What are they?] For 
a spinning top (see Figure 18.4 (centre)), the axis of symmetry is a principal axis at O 
and so is any axis through O that is perpendicular to it! We seem to be finding too many 
principal axes, but this will soon be explained. 


Dynamical symmetry 


Consider a rigid body B pivoted at the origin O. Then the motion of B under known forces 
is determined by the angular momentum principle, which, in view of equation (18.24), 
can be written 


d 
— (I . —=K 
7p ro 4) O 


where K og is the total moment about O of the external forces acting on B. In principal 
axes, I 9 contains only the three principal moments of inertia A, B and C and so the shape 
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aly, 
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FIGURE 18.4 Three bodies with dynamical axial symmetry. 


of the body enters into the equations of motion only by the values of these three numbers. 
Under the same set of forces and initial conditions, different bodies that have the same 
principal moments of inertia have the same motion. 

The form of the motion, and the chance of solving problems, is very much influenced 
by the presence or absence of dynamical symmetry. There are three cases: 


A, B,C all different 


When {A, B, C} are all different, the body is said to be dynamically unsymmetric, and, 
as might be expected, this is the most difficult case to treat analytically. For the rectangular 
plate shown in Figure 18.3, the principal moments of inertia at G are A = 5M b*, B= 
5M a? andC = 3M (a +b?), which are all different. Thus, despite the apparently simple 
shape of the plate, it is no easier to treat than a lump of scrap metal. In the unsymmetrical 
case, the principal axes at O are essentially unique.* 


A and B equal, C different 


Suppose now that two of the principal moments of inertia (A and B say) are equal, the 
third, C, being different. The body is then said to be dynamically axially symmetric 
about the axis Ox3. This time, the axis of symmetry is a principal axis, and the other 
two axes can be chosen arbitrarily, provided that they form a right-handed orthogonal set. 
One might expect this to occur only when the body is a body of revolution about an axis 
through O, as in the case of a traditional spinning top. However, the following remarkable 
result shows that this is not neccessary. 


Rule 3: Jf a body has a rotational symmetry about an axis through O, and the order" of 
this symmetry is three or more, then the body has dynamical axial symmetry about this 
axis. 


* The principal axes are unique in the sense that the three orthogonal unlabelled lines in space are unique. 
There are then 24 ways of labelling these lines to create a right-handed system of coordinates. 

+ If the mass distribution of a rigid body is left unchanged when the body is rotated through an angle 2z/n, 
then the body is said to have rotational symmetry of order n about the rotation axis. Thus a uniform 
lamina having the shape of a regular pentagon has rotational symmetry of order five about the axis through 
G perpendicular to its plane. 
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FIGURE 18.5 Three bodies with dynamical spherical symmetry. 


The pentagonal cone in Figure 18.4 has rotational symmetry of order five and so Rule 3 
applies. In particular, it follows that the cone has the same moment of inertia about any 
axis through G pependicular to the symmetry axis! 

Interestingly, it is possible for a body to have dynamical axial symmetry and yet have 
no rotational symmetry at all. The body in Figure 18.4 (right) has no rotational symmetry, 
but, since it is made up of three bodies that do have such symmetry, it must be dynamically 
axially symmetric! 


A, B and C all equal 


Suppose now that all three of the principal moments of inertia at O are equal. The body is 
then said to be dynamically spherically symmetric about O. This time, the inertia tensor 
is a multiple of the identity and any axis through O is a principal axis. The principal axes 
can therefore be chosen arbitrarily, provided that they form a right-handed orthogonal set. 
A uniform ball is obviously dynamically spherically symmetric, but there are many other 
examples. This follows from our last rule, Rule 4. 


Rule 4: [fa body has two different axes of dynamical axial symmetry through O, then it 
must be dynamically spherically symmetric about O. 


Figure 18.5 shows three bodies that have dynamical spherical symmetry about their 
centres of mass. None of these bodies is a ball, but they move as if they were. For 
example, the uniform tetrahedron has four different axes of dynamical axial symmetry 
through G and so, by Rule 4, it must be dynamically spherically symmetric about G. As 
a consequence, it must have the same moment of inertia about any axis through G. [Don’t 
try proving this any other way! ] 
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Problems on Chapter 18 


Answers and comments are at the end of the book. 


Harder problems carry a star (*). 


Orthogonal transformations 


18.1 Show that the matrix 
2 6 
-6 3 2 
6-3 


is orthogonal. If A is the transformation matrix between the coordinate systems C and C’, do C 
and C’ have the same, or opposite, handedness? 


18.2 Find the transformation matrix between the coordinate systems C and C’ when C’ is 
obtained 


(1) by rotating C through an angle of 45° about the axis O-x2, 
(ii) by reflecting C in the plane x2 = 0, 


(iii) by rotating C through a right angle about the axis OB, where B is the point with 
coordinates (2, 2, 1), 
(iv) by reflecting C in the plane 2x, — x2 + 2x3 = 0. 


In each case, find the new coordinates of the point D whose coordinates in C are (3, —3, 0). 


18.3 Show that the matrix 


2-1 -—2 
A=-|2 2 1 
1—2 2 


is orthogonal and has determinant +1. Find the column vectors v that satisfy the equation 
A-v =v. If A is the transformation matrix between the coordinate systems C and C’, show 


nae 
that A represents a rotation of C about the axis OE where E is the point with coordinates 
d, 1,-1) inc. 

* Find the rotation angle. 


Tensor algebra 


18.4 Write out the transformation formula for a fifth order tensor. [The main difficulty is 
finding enough suffix names! ] 


18.5 In the coordinate system C, a certain second order tensor is represented by the matrix 
101 
T=/]010 
101 


Find the matrix representing the tensor in the coordinate system C’, where C’ is obtained 
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(i) by rotating C through an angle of 45° about the axis Ox1, 
(ii) by reflecting C in the plane x3 = 0. 


18.6 The quantities ¢;;, and u;j;,; are third and fourth order tensors respectively. Decide if 
each of the following quantities is a tensor and, if it is, state its order: 


3 
(i) CijkUImnp (ii) Cijktimn (iii) yy 
j= 
3 3 3 
(iv) yt (v) > fe (vi) So tijatlcimn 
j=l i=1 k=1 


3. 3 


3 3 3 3 . 
(vii) SY aay (viii) )o weir (ix) Dy Dy tte. 
k=1 


i=1 j=1 i=1 j=1 k=1 


18.7 Show that the sum of the squares of the elements of a tensor is an invariant. [First and 
second order tensors will suffice.] 


18.8 If the matrix T represents a second order tensor, show that det T is an invariant. [We 
have now found three invariant functions of a second order tensor: the sum of the diagonal 
elements, the sum of the squares of all the elements, and the determinant. ] 


18.9 In crystalline materials, the ordinary elastic moduli are replaced by c;;,;, a fourth order 
tensor with eighty one elements. It appears that the most general material has eighty one elastic 
moduli, but this number is reduced because c; jx; has the following symmetries: 


(1) C jikl = Cijkl (ii) Ci jk = CijKl (til) Cktig = Cijkl 
How many elastic moduli does the most general material actually have? 


Inertia tensor and principal axes 
The following problems do not require moments or products of inertia to be evaluated by integration. 


18.10 Show that /;o,n;, the moment of inertia of a body about an axis through O parallel to 
the unit vector n, is given by 


Tony = n’. Io -n, 


where Ig is the matrix representing the inertia tensor of the body at O (in some coordinate 
system), and n is the column vector that contains the components of n (in the same coordinate 
system). 

Find the moment of inertia of a uniform rectangular plate with sides 2a and 2b about a 
diagonal. 


18.11 Find the principal moments of inertia of a uniform circular disk of mass M and radius 
a (i) at its centre of mass, and (ii) at a point on the edge of the disk. 


18.12 A uniform circular disk has mass M and radius a. A spinning top is made by fitting 
the disk with a light spindle AB which passes through the disk and is fixed along its axis of 
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symmetry. The distance of the end A from the disk is equal to the disk radius a. Find the 
principal moments of inertia of the top at the end A of the spindle. 


18.13 A uniform hemisphere has mass M and radius a. A spinning top is made by fitting the 
hemisphere with a light spindle AB which passes through the hemisphere and is fixed along its 
axis of symmetry with the curved surface of the hemisphere facing away from the end A. The 
distance of A from the point where the spindle enters the flat surface is equal to the radius a of 
the hemisphere. Find the principal moments of inertia of the top at the end A of the spindle. 


18.14 Find the principal moments of inertia of a uniform cube of mass M and side 2a (i) at 
its centre of mass, (ii) at the centre of a face, and (iii) at a corner point. 

Find the moment of inertia of the cube (i) about a space diagonal, (ii) about a face diagonal, 
and (iii) about an edge. 


18.15 A uniform rectangular block has mass M and sides 2a, 2b and 2c. Find the principal 
moments of inertia of the block (i) at its centre of mass, (ii) at the centre of a face of area 4ab. 
Find the moment of inertia of the block (i) about a space diagonal, (i1) about a diagonal of a 
face of area 4ab. 


18.16 Find the principal moments of inertia of a uniform cylinder of mass M, radius a and 
length 25 at its centre of mass G. Is it possible for the cylinder to have dynamical spherical 
symmetry about G? 


18.17 Determine the dynamical symmetry (if any) of each the following bodies about their 
centres of mass: 
(i) a frisbee, 
(11) a piece of window glass having the shape of an isosceles triangle, 
(iii) a two bladed aircraft propellor, 
(iv) a three-bladed ship propellor, 
(v) an Allen screw (ignore the thread), 
(vi) eight particles of equal mass forming a rigid cubical structure, 
(vil) a cross-handled wheel nut wrench, 
(viii) the great pyramid of Giza, 
(ix) a molecule of carbon tetrachloride. 


18.18% A uniform rectangular plate has mass M and sides 2a and 4a. Find the principal axes 
and principal moments of inertia at a corner point of the plate. [Make use of the formula for 
Ic obtained in Example 18.6, with b = 2a.] 


If you know how, do this question by finding the eigenvalues and eigenvectors of Ic. If not, try the following 
homespun method: Starting from I¢ in the coordinates used in Example 18.6, find IG in the coordinate 
system obtained by rotating through an angle a around the axis Cx3. Then choose a to eliminate the 


off-diagonal elements. 
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Problems in rigid body dynamics 


KEY FEATURES 

The key features of this chapter are the use of the linear and angular momentum principles 
to generate the equations of rigid body motion; the importance of body symmetry in sim- 
plifying the problem; and the choice of solution method, vectorial, Lagrangian or Eulerian. 


Readers who have reached this point have come a long way and deserve to be congrat- 
ulated. With the linear and angular momentum principles and the inertia tensor behind us, 
we are now able to solve some of the most interesting and puzzling problems in mechan- 
ics. Rigid body motion is the pinnacle of achievment* in the ‘problem solving’ kind of 
mechanics but, given the background we now have, many problems are surprisingly easy. 
In this chapter, we classify problems by the dynamical symmetry of the body; first we con- 
sider spherically symmetric bodies, then axially symmetric bodies and finally unsym- 
metric bodies. Most problems can be done by either vectorial or analytical methods and 
we choose whatever is appropriate in each case. 


19.1 EQUATIONS OF RIGID BODY DYNAMICS 


Rigid body in general motion 


dV 
Translational motion of G : M oe =F (19.1) 
dL 
Rotational motion about G : ae =Kg (19.2) 


* More advanced topics exist, but they are more concerned with ‘grand general principles’ than problem 
solving! 
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The governing equations of rigid body dynamics are the linear and angular momen- 
tum principles in their centre of mass form, as shown above; the notation is that used in 
Chapters 10 and 11. The linear momentum principle (19.1) determines the translational 
motion of the centre of mass G, and the angular momentum principle (19.2) determines 
the rotational motion of the body relative to G. 

The above equations could be applied in all cases, but, when the body is pivoted* 
at a fixed point O, it is more convenient to drop the first equation and apply the angular 
momentum principle about the pivot point O instead of G. With this choice, the unknown 
reaction at the pivot is eliminated. We then have: 


Rigid body pivoted at a fixed point O 


dL 
Rotational motion about O : = =Ko (19.3) 


Calculating Lg and Lg in principal axes 


In order to use the equation (19.2) (or (19.3)) we need to express Lg (or Lg) in terms of 
@, the angular velocity of the body. In general, this is a tensor relation, but, in principal 
axes, it reduces to the vector form obtained in Section 18.6. The results are as follows: 

Suppose Gx ,x2x3 are principal axes at G with associated unit vectors {e1, e2, e3}. 
Then 


Lg = Aa, e; + Barer + Cw3e3 (19.4) 


where A, B,C are the principal moments of inertia of the body about the axes Gx,, Gxo, 
Gx3 respectively. Similarly, the angular momentum Lg can be expressed in terms of the 
angular velocity @, relative to principal axes of the body at O, by the formula 


Lo = Aa, e; + Bager + Cw3e3 (19.5) 


where A, B, C are now the principal moments of inertia of the body about the axes Ox1, 
Ox2, Ox3 respectively. In each case, {w1, w2, w3} are the components of @ in the principal 
axes, that is, 


@=@1e€} +a2€2+ W363. 


Dynamical symmetry of the body 


The most important general feature of a moving rigid body is its dynamical symmetry 
(see Section 18.7). The more symmetry the body has, the simpler is its motion and the 


* The spinning top with its vertex held at a fixed point is a typical example. 
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FIGURE 19.1 In the motion of the Earth, its angular velocity is preserved 
and is not normal to the plane of the orbit. This gives rise to the seasons 
(shown here for the northern hemisphere). 


easier it is to calculate it. We begin with the simplest case, which is when the body has 
dynamical spherical symmetry. Remember that, in order to have dynamical spherical 
symmetry, the body need not actually be spherical; indeed, one can find bodies that have 
no geometrical symmetry at all that are dynamically spherical. 


19.2 MOTION OF ‘SPHERES’ 


Suppose the body has dynamical spherical symmetry about its centre of mass 
G. The body could be a uniform sphere, but it does not have to be. All we require is that 
A = B =C atG, in which case the body has the same moment of inertia A about every 
axis through G. Equation (19.4) then simplifies to give 


Lg = Aa) e| + A@2 e2 + Aw3e3 = A (a1 e] + @2€2 + 33) = Ao, 
so that LG is simply proportional to w. Equation (19.2) then becomes 
Aw= Kg. 


If the body is moving under no forces, or under uniform gravity, then the total moment 
Kg = 0 and we obtain 


@ = 0, 


a constant. Thus, in either of these cases, the angular velocity of the sphere is preserved 
in the motion. Meanwhile, the motion of G, as determined by equation (19.1), is either 
straight line or parabolic motion respectively. Hence, when one throws a ball (or a cube!), 
G traces out the usual parabola and @ retains the value that it was given initially. Note 
that @ retains its direction as well as its magnitude, so that even though the direction of V 
changes, the rotation axis maintains a constant direction in space. 
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—Mgk 


FIGURE 19.2 The snooker ball is a uniform sphere of mass M 
and radius b that moves in contact with a horizontal table. 


The most important instance of this is the motion of the Earth in its orbit. If we assume 
that the Sun and the Earth are spherically symmetric bodies,* then the total moment of 
forces Kg exerted on the Earth by the Sun is zero and @ is preserved. Since @ is not 
normal to the plane of the orbit, the motion looks like Figure 19.1 (not to scale!), and this 
gives rise to the seasons. 


19.3 THE SNOOKER BALL 


Consider a snooker or pool ball, with mass M and radius b, that moves in contact 
with a horizontal table (see Figure 19.2). First we will consider general motion of the 
ball. In particular, we will allow skidding so that there is no special relation between V, 
the velocity of G, and the angular velocity @. The table is rough’ so that the reaction X 
exerted by the table on the ball is not necessarily vertical. The governing equations (19.1) 
and (19.2) become 


MV =X—Mgk 
Lg = (—-bk)xX, 


where Lg = Aq. On eliminating the reaction X we find that 


A@ = b (MV + Mgk) xk 
= MbV xk. 


* This is a very good approximation but (like everything else in astronomy) it is not exact. The Earth is 
slightly spheroidal and the gravitational fields of the Sun and Moon give rise to a small resultant moment 
about G. As a consequence, w actually precesses very slowly around the normal to the plane of the orbit 
(once every 25,730 years). 

+ But not so rough that the ball is compelled to roll. 


526 Chapter 19 Problems in rigid body dynamics 
On integrating with respect to t, we obtain the non-standard conservation principle* 
A@+MbkxV=C, (19.6) 


where C is a constant vector. This is true in any motion of the ball, whether skidding or 
rolling. 

In particular, if we take the scalar product of this equation with k, we obtain the scalar 
conservation principle 


o-k=n, (19.7) 
where n is a constant. Thus the vertical component of w is conserved in any motion of the 
ball. 

We now examine the special case of rolling. In this case the contact particle C has 
zero velocity, so that the rolling condition is 
V+@x(—bk) =0, 
that is, 
V+bkxo=0. (19.8) 


Now, from the conservation principle (19.6), it follows that 


Akxw=kxC — Mbkx(kxV) 
=kxC —Mb((k-V)k—(k-k)V) 
=kxC+MbV, 


since k is a unit vector and is perpendicular to V. On substituting this result into the 
rolling condition (19.8) we obtain 


Mb? b 
V+{—]V={|-]Cxk, 
A A 
which shows that V must be constant in any rolling motion. 
The corresponding value of @ is 


1 
=e ok (19.9) 


which is also constant. Hence: 


* Tt is actually the angular momentum principle applied about the non-standard point C. 
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FIGURE 19.3 The free motion of a body with 
dynamic axial symmetry, depicted here as a 
square plate. The principal axis G3 is the 
axis of symmetry and a (= a(t)) is the axial 
unit vector. 


The only rolling motions that are possible are straight line motions with constant V 
and constant @; this is despite the presence of the vertical angular velocity component n. 


This fact makes the games of pool and snooker possible. In practice, it is difficult to 
strike a ball and not give it some ‘side’ (a non-zero value of 7). If rolling took place along 
curved lines when the ball had ‘side’, the game would be impossibly difficult. As it is, 
players often give the ball plenty of ‘side’ deliberately. The ball still rolls in a straight line, 
but the ‘side’ affects the bounce of the ball when it hits a cushion; good players use this 
fact to control the ball. As a corollary, it follows that, if a player actually wants to swerve 
a ball, then he must make the ball skid on the table. 


19.4 FREE MOTION OF BODIES WITH AXIAL SYMMETRY 


Bodies with dynamical axial symmetry are the most important objects whose 
motion we study. This is for three reasons: (i) They are much more common than spheri- 
cally symmetric objects, (ii) they perform more interesting motions than spherically sym- 
metric bodies, (iii) their motions can often be calculated in closed form. 

Consider a rigid body with dynamical axial symmetry. It could be a baseball bat, a 
pencil, a top, a spacecraft, a freely-pivoted gyroscope, or the square plate shown in Figure 
19.3. The centre of mass G of such a body must lie on the axis of symmetry and we 
will suppose that the principal moments of inertia of the body at G are {A, A, C}, with C 
being the moment of inertia about the symmetry axis. C may be greater or less than A. 
For example, if the body is a short fat cylinder, then C > A, but, if the the body is a long 
thin cylinder, then C < A. 

Consider such a body moving under either (i) no forces, or (ii) uniform gravity. 
Then the motion of G is given by particle mechanics and it only remains to calculate the 
rotational motion of the body relative to G. Since Kg = 0 in each of these cases, the 
equation of rotational motion is Lg = 0, where, in the principal axes Gx1x2x3, 


Lg = Aa, e; + Aw2e2 + Cw3e3. (19.10) 
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It should be remembered that the principal axes Gx\x2x3 move with the body and 
that the unit vectors {e1, €2, e3} are therefore functions of the time. 


In what follows we will denote the axial unit vector e3 by a (= a(t)); this is simply 
to improve readability. We first obtain an expression for the angular velocity @ in terms 
of a. Consider the point of space that has position vector a relative to G. Since the axial 
vector a moves with the body, this is the position vector of the particle A of the body that 
is at unit distance from O along the axis Gx3. (If this point lies outside the body, then it 
can simply be included as an extra particle of zero mass.) It follows that a, the velocity of 
A, is given by 


a=oxa. 
If we now take the vector product of this equation with a, we obtain 


axa =ax(@xa) 


(a-a)@—(a-@)a 


=o-—(a-o)a. 
Hence @ must have the form 
@=axathka, (19.11) 


where A (= @ - a) is a scalar function of the time. 

The second step is to calculate the corresponding angular momentum Lg. In the 
expression (19.11), the term Aa lies in the direction of the principal axis G3 and the term 
a xX a@ is perpendicular to a and is therefore also in a principal direction. It follows from 
the formula (19.10) that the corresponding angular momentum Lg is given by 


Lg = Aaxa+Cha. (19.12) 


The equation for the rotational motion of the body about G is therefore 


< (Aaxa+Cha)=0, (19.13) 
that is, 

Aaxa+C(ha+ha) =0. 
If we now take the scalar product of this equation with a, we obtain 


Aa+(ax@)+C(A(a-a) +A(a-a)) = 0. 


Now the triple scalar product a - (a x a) has two elements the same and is therefore zero; 
also, since a is a unit vector, a-a = 1anda-a =O. It follows that A = 0, thatisA =n,a 
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constant. Hence the axial component of @ is a constant. We will call this axial component 
of the spin of the body. 
The equation of motion for the body is therefore 


Equation of motion for a free axisymmetric body 
(19.14) 
Aaxa+Cna=Lg 


where the spin 7 (= @ - a) and the angular momentum Lg are constants determined by 
the initial conditions. 

Surprisingly, this equation has a simple exact solution. First, let us write Lg = Lk, 
where L is the magnitude of Lg and k is the unit vector in the same direction as Lg, as 
shown in Figure 19.3. Then, on taking the scalar product of equation (19.14) with a, we 
obtain 


Aa:(axa)+Cna-a=L(a-k), 
which simplifies to give 
Cn=L(a-k) =Lcosa, 


where @ is the angle between a and k.* It follows that @ is constant and that n, the constant 
axial component of @, is given by 


Lecosa 
n= : (19.15) 
C 


Thus the axis of symmetry of the body makes a constant angle with k and so sweeps out a 


cone around the axis {G, k}. 


The progress of the axis of symmetry in time can be found by taking the vector product 
of equation (19.14) with a. This gives 


Aax(axa)+Cnaxa=L(axk), 
that is, 
A((a+a)a—(a-a)a)+0=L(axh). 


Since a is a unit vector,a-a = 1anda-a =O and we obtain 


, = ay 19.16 
a= ah) xa, ( . ) 


This equation shows that the axis of symmetry of the body precesses around the axis {G, k} 
with constant angular speed L/A. 


* We will suppose that 7 is positive so that a is an acute angle. 
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Motion viewed from the precessing frame 


It is instructive to view this motion from a rotating frame in which the axis of symmetry 
is at rest. The true angular velocity of the body is given by 


@=axat+na 


_ La ) Locosa 
= ax (Fc <a)) + ( C Ja 


z k+L aa (19.17) 
=> — COs —. 2 ‘ 
A “\ae )* 


Suppose we now view the motion from a frame with origin at G that is rotating about 
the axis {G, k} with angular speed (L/A)k. In this precessing frame, the axial vector a 
is at rest. Also, Theorem 17.1 (on the addition of angular velocities) tells us that, in the 
precessing frame, the apparent angular velocity of the body is 


A-C 
w = Lcosa | ——— Ja, 
AC 


that is, the body apparently rotates about its fixed axis of symmetry with angular velocity 
Lcosa(A —C)/AC. The true motion is thus composed of (i) this constant axial rotation, 
and (ii) a precession around the axis {G, k} with constant angular speed L/A. 

Our results are summarised as follows: 


Free motion of an axisymmetric body 


e The axis of symmetry of the body makes a constant angle a with the angular 
momentum vector L k and precesses around the axis {G, k} (in the positive sense) 
with constant angular speed L/A. 


e In this motion, the axial spin n of the body has the constant value L cos a/C. 


e In the precessing frame, the body apparently rotates about its fixed axis of sym- 
metry with constant angular velocity L cosa(A —C)/AC. 


The motion described above is very familiar, at least qualitatively. The precession of 
the symmetry axis looks like a ‘wobble’ superimposed on the spinning motion of the body 
about the symmetry axis. 


Example 19.1 Wobble on a frisbee 


A frisbee is spinning with constant angular speed Q about its axis of symmetry when 
its motion is slightly disturbed. What is the angular frequency of the resulting wob- 
ble? 


Solution 


Suppose that the new angular momentum of the frisbee is Lk with the axis of the 
frisbee making a small angle w with the fixed axis {G, k}. Then the axis of the frisbee 
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FIGURE 19.4 A symmetrical top with its vertex fixed at O. The top moves under uniform 
gravity and the reaction force X of the table. 


precesses around the axis {G, k} with angular speed L/A. It is this precession that 
is observed as a wobble. The angular momentum magnitude L is not known exactly, 
but it differs only slightly from that in the undisturbed state, namely CQ. Hence 
the precession rate is CQQ/A approximately, and this is the angular frequency of the 
wobble. If we regard the frisbee as a circular disk, then C = 2A and the angular 
frequency of the wobble is 202, approximately twice the spin of the frisbee about its 
axis. 


The above results can also be obtained by using Lagrangian mechanics. The method 
is described in Section 19.6. 


19.5 THE SPINNING TOP 


Consider the symmetrical spinning top* shown in Figure 19.4 (left), which has its 
vertex fixed at the origin O. One may imagine that the vertex of the top is lodged in a 
small smooth pit in the table from which it cannot escape. The top moves under uniform 
gravity and the unknown reaction force X exerted by the table. Since the top has a particle 
fixed at O, we take the equation of motion in the form 


Lo =Ko, 
where K o is the moment of the external forces about O. In the present case, 


Ko =0xX + (ha) x (—Mgk) 
= —Mghaxk, 


* A ‘top’ should be understood to mean any rigid body with dynamical axial symmetry that is pivoted at a 
point on its symmetry axis, and is moving under uniform gravity. 
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where M is the mass of the top and h is the distance OG. 
The angular momentum Lg is calculated exactly in the same way as in the last section. 
The angular velocity @ can be expressed in the form 


@=axat+ka, 
and Lg is then given by 
Lo = Aaxa+Cia, 


where {A, A, C} are the principal moments of inertia of the top at the vertex O, with C 
being the moment of inertia of the top about its axis of symmetry. The equation of motion 
for the top is therefore 


< (Aaxa+Cha) = —Mghaxk, 
that is 
Aaxa+C(asa +aa) = —Mghaxk. 
If we now take the scalar product of this equation with a, we obtain 
Aa: (axa@) +C(A(a- a) +A(a- a)) = —Mgha- (axk). 


The triple scalar products are both zero since each has two elements the same. Also, since 
ais a unit vector,a- a= 1anda-a =O. It follows that L= 0, so that 2 = n, a constant. 
Hence the axial component of @ is a constant. We call this axial component of the spin 
of the top. The equation of motion for the top can therefore be written 


Equation of motion for the top 


(19.18) 


d 
a (Aaxa + Cra) = —Mghaxk 


where the spin n (= @ - a) is a constant determined by the initial conditions. 


Steady precession 


We will calculate the general motion of the top in the next section by using the Lagrangian 
method. Here we will investigate only the special, but important, motion called steady 
precession. In this motion, the axial vector a maintains a constant angle a with the vertical 
and precesses round the axis {O, k} with constant angular speed Q, as shown in Figure 
19.4 (right). In this steady precession the rate of change of a is given by 


a = (Qk) xa. (19.19) 
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We will now investigate whether steady precession of the top can actually occur. From 
equation (19.16), it follows that, in steady precession, 


axa=ax(Qkxa) 
=Q(a-a)k—Q(a-k)a 
= Q(k —cosaa). 


Hence 


d 

qx = Q(0 — cosaa) 

—Qcosa(Qk) xa 

= 0? cosa(axk). (19.20) 


On substituting equations (19.20) and (19.19) into the equation of motion (19.18), we 
obtain 


(Acosa2? — Cn 2+ Mgh) (axk) =0. 


This is the condition for steady precssion of the top. Since axk 4 0, steady precession 
can only occur if there are values of aw and Q that satisfy the equation 


AcosaQ? —CnQ+Mgh =0. (19.21) 
This quadratic equation will have real roots for the angular rate Q if 
C*n* > 4AM ghcosa. (19.22) 


If the angle a is obtuse, then this condition always holds, even when n = 0. However, 
the case in which the top executes a conical type of motion below the pivot O is not 
interesting. The interesting case is that in which @ is an acute angle and the top precesses 
in an ‘upright’ position. Our result is as follows: 


The top can undergo steady precession at an acute angle a to the upward vertical 
if n (the axial component of @) is large enough to satisfy the condition (19.22). If this 
condition is satisfied there will generally be two different values of the precession rate for 
each choice of the angle a. 


Fast and slow precession 


The two solutions of equation (19.21) for the precession rate Q2 are 


Cnt (or — 4AM ghcos a)'? 


2Acosa 


‘6 4AM ch We 
Ch i (1- Segoe =") |: (19.23) 


QF s 


C2n2 
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where the fast (F’) precession rate corresponds to the ‘plus’ choice and the slow precession 
rate (S') corresponds to the ‘minus’ choice. In practical circumstances the value of the spin 
nis often such that the dimensionless ratio 


4AMgh 
C2n2 
is small compared to unity. In this case, Q* and QS are given approximately by 


Cn Gsm en 


OF & 
Acosa’ Cn 


(19.24) 


The fast precession rate is approximately directly proportional to n and is indepen- 
dent of the gravitational acceleration! In fact, it approximates the force-free precession 
found in the last section. This motion is almost impossible to observe in a real top. It 
has a precession rate of similar magnitude to n which would make the vertex of the top 
extremely difficult to secure. The fast precession may however explain the trembling 
motion sometimes seen when a top is spinning slowly with its axis almost vertical. 

In contrast, the slow precession rate is approximately inversely proportional to n and 
independent of the angle a. This is the motion commonly observed; the faster the spin of 
the top, the slower the rate of precession. 


Example 19.2 A simple top 


A top is made by sticking a light pin of length 3 cm through the centre of a uniform 
circular disk of mass M and radius 8 cm. Find the rates of slow and fast precession 
of the top when it is given a spin of 10 revolutions per second and @ is small. [Take 
g=10ms-.] 


Solution 

For a uniform disk of mass M and radius a, the principal moments of inertia at its 
centre of mass are { iM a iM a”, 5M a} and so , by the theorem of parallel axes, 
the principal moments at the point of the pin are A = B = iM (a* + 4h), C = 
5M a’, where h is the length of the pin. Hence, on putting in the given dimensions, 
A = B = 25M/10000 and C = 32M /10000, where M is the mass of the disk. We 
are also given that n = 20z radians per second and that a is small. The quadratic 
equation for the precession rate Q is therefore 


25M 32M 30M 
QQ? Ox —_ 90, 
10000 10000 nace 100 


The roots of this equation are QS = 1.52 and Q* = 78.9 radians per second. The 
approximate formulae in equation (19.24) give 1.49 and 80.4 respectively. [In this 
case the dimensionless ratio 44M gh /(C*n*) © 0.074 so a reasonable approximation 
could be expected.] 
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G 


FIGURE 19.5 Left: The Euler angles 6, ¢@ and w. Right: The corresponding ‘velocity’ diagram. 


19.6 LAGRANGIAN DYNAMICS OF THE TOP 


The Euler angles 


In order to solve problems in rigid body dynamics by Lagrangian mechanics, we need a 
set of generalised coordinates for a generally moving rigid body. The position of the 
centre of mass G can be specified by its three Cartesian coordinates, but the position of 
the body relative to G still needs to be specified. The best practical coordinates for this 
purpose are the Euler angles* 6, ¢ and y shown in Figure 19.5 (left). In order to fix the 
position of the body, it is sufficient to specify the positions of two more of its particles, A 
and B say, chosen so that G, A, B do not lie in a straight line. The position of A is fixed 
by the ‘polar angles’ 6 and ¢, measured relative to the fixed Cartesian coordinate system 
Gxyz. This does not yet fix the position of B since the rigid triangle GAB can still rotate 
around GA. The position of B becomes becomes fixed when we specifiy the angle w 


* Since 


through which the triangle GAB has been rotated from some reference position. 
the angles 0, @, yy are clearly independent variables, it follows that the Euler angles 0, ¢, 


w are a Set of generalised coordinates for the rotational motion of a rigid body. 


The Euler angles are a particularly appropriate set of coordinates when the body has 
axial symmetry. In this case it is usual to take the A to be a particle lying on the symmetry 
axis. Then @ and ¢ determine the orientation of the symmetry axis and w is the rotation 
angle of the body around the symmetry axis. 

We may now construct the velocity diagram corresponding to this choice of coordi- 
nates. When the coordinates 0, ¢, y increase individually, each gives the body an angular 
velocity as shown in Figure 19.5 (right). The angular velocity of the body in general 
motion is then obtained by adding these three angular velocities vectorially. 


* After Leonhard Euler (1707-1783), the greatest Swiss mathematician, and the most prolific of all time. 
+ The triangle GAB is in its standard reference position when it lies in the plane AGz with B ‘below’ the 
line GA. The configuraton shown in Figure 19.5 thus has a y value of nearly 277. 
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w+ cos 


FIGURE 19.6 The angular velocity of the top resolved along 
three perpendicular principal directions. 


Lagrangian for the top 


For a top with its vertex fixed at the origin O, we take a fixed system of Cartesian coor- 
dinates Oxyz with Oz vertically upwards and define the Euler angles as shown in Figure 
19.5 (left) with G replaced by O. Since the top has axial symmetry we also take A to lie 
on the symmetry axis, which is a principal axis of the top at O. The top is thus a holo- 
nomic system and can be solved by Lagrangian method. To find the kinetic energy of the 
top in terms of the Euler angles, the angular velocities in Figure 19.5 need to be resolved 
along three perpendicular principal axes, as shown in Figure 19.6.* The kinetic energy 
of the top is therefore given by 


T =1A6? +44 (bsind)’ + 4C (+ bc0s6)’, (19.25) 


where {A, A, C} are the principal moments of inertia of the top at O. The gravitational 
potential energy of the top is simply 


V = MgZ = Mghcos8, 


where /i is the distance OG. Hence the Lagrangian of the top, with the Euler angles as 
coordinates, is: 


Lagrangian for the top 
(19.26) 
L= 5A6? + 5A (¢ sin0)” + 5C (W + cos)” — Mghcosé 


Conserved quantities 


Since @ and y appear in the Lagrangian only as ¢ and yj, they are both cyclic coordi- 
nates. It follows that the generalised momenta pg and py are constants of the motion. On 


* These principal axes are not ‘embedded’ in the top since they do not rotate as w increases. 
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evaluating pg (= dL / dp) and py (= dL/d w), these conservation principles become 


Agsin? 6 + C(W + ¢cos 0) cos = L-, (19.27) 


C(W + ¢cos 6) = Cn, (19.28) 


where n and L, are constants, determined by the initial conditions. In fact, L, is the z- 
component of the angular momentum Lo, and n is the axial component of the angular 
velocity @, the quantity known as the spin of the top.* Without losing generality, we will 
assume that n is positive. 

These two conservation principles, together with the energy conservation equation 


5A6” + 5A(bsind)* + 4C(h + bcos 0)” + Mghcosé = E, (19.29) 


are sufficient to determine the motion of the top. The energy conservation equation is 
preferable to the Lagrange equation for the coordinate 0 because it contains only first 
derivatives of the Euler angles. 


The equation for the inclination angle 0 


By making use of the conservation equations (19.27) and (19.28) in the energy equation 
(19.29), we can eliminate d and W to give 


(L, — Cncos@)? 
Asin? 0 


A@? = 2E —Cn? 2M ghcos6, (19.30) 
where the constants n, L,, E are determined by the initial conditions. This ODE for the 
unknown function 6(t) has the form 


ee F (cos 6) 
Aen? Dp’ 
A? sin* 0 
where the function F'(u) is defined by 


F(u) = A(2E — Cn*)(1 — u’) — (Lz —Cnu)? — 2AMghu(1 — uv’). 


This equation is similar to the corresponding equation in the theory of the spherical pen- 
dulum (see Chapter 11) and we can make similar deductions about the behaviour of the 
coordinate 6. Because the left side of (19.30) is positive, it follows that the motion is 
restricted to those values of @ that make the function F'(cos @) positive. Moreover, the 
maximium and minimum values of @ can only occur when F(@) = 0. 

The equation F (u) = 0 is a cubic in the variable uv and the positions of its roots can 
be deduced by considering the sign of F'. We have the following information: 


* Note that n = W + dcos@, not w. 
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FIGURE 19.7 The form of the cubic function 
F (u). The black squares are at its roots. 


(i) As u tends to positive (or negative) infinity, F (u) tends to positive (or negative) infinity. 
(ii) F(1) = —(L- — Cn)? and F(—1) = —(L- + Cn)? which are both negative." 


(iii) Given that 69, the initial value of 6, lies in the range 0 < 69 < z and that 6 is not zero 
initially, then F (cos 69) must be positive. 


The graph of F (uw) must therefore have the form shown in Figure 19.7. It follows that the 
equation F (uw) = O must have a root Umin lying in —1 < u < cos, a root Umax lying 
in cos @) < u < 1, and a third root lying in uv > 1, as indicated by the black squares in 
Figure 19.7; this accounts for all three roots. The root greater than unity is not physically 
admissible since it lies outside the range of u when u = cos@. It follows that, in the 
motion, u must oscillate in the range Umin < U < Umax. Hence, as in the case of the 
spherical pendulum, the inclination angle 6 must perform periodic oscillations between 
two extreme values a and 6. The difference here is that it is possible for both a and 6 
to be acute angles (measured from the upward vertical); in other words, the top can stand 
‘upright’. This oscillation of the inclination angle @ is called nutation of the top. 


Example 19.3 Finding the range of the inclination angle 


Suppose that the top is released with its axis at rest and making an angle of 2/3 with 
the upward vertical. Find the function F (uw). For the case in which C*n* = 4AM gh, 
find the range of values of @ that occur in the subsequent motion. 


Solution 
On using the initial conditions 6 = 2/3, 6 = 0, 6 = 0 whent = 0, we find that 
a 5c nand E = 5C ne 4 5M gh. The function F (uv) therefore reduces to 


F(u) = 41-2) [4AMeh(l — u2) — C2n? (1 — 2u) | 


and when C?n? = 4AM gh, this becomes 


F(u) = AMghu(i — 2u)(2 — u). 


* F(41) is zero when L; = +Cn respectively. This corresponds to a special (and not very important) class 
of motions in which the axis of the top passes through the upward or downward vertical. We exclude these 
cases from consideration. 
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FIGURE 19.8 Three different forms for the motion of the axis of symmetry of the top. The paths 
show the movement of the point in which the axis of symmetry intersects a fixed sphere with centre 


O. 


The physical roots of the equation F (uv) = 0 are u = O and u = 5 and so @ oscillates 
in the range 2/3 < 0 < 1/2. This is the motion shown in Figure 19.8 (centre). 


Question Finding the period of 0 
Find the period of the angle 9 in the same motion. 


Answer 


With the same set of initial conditions (and with C?n? = 4AM gh), the ODE for 6 
reduces to 


_ M ghcos@(1 — 2cos@)(2 — cos @) 


62 
Asin? 6 


On taking square roots and integrating, the period t of the function @(t) is found to 
be 


5(_4 a ie sin@ d6 gag A ue a 
— ——— XY. —— ‘ 
Mgh 1/3 [cos@(1 — 2cos@)(2 — cos 6)]!/2 Mgh 


Precession of the axis 


Once the function @(t) has been determined (at least in principle), the precession angle 
o(t) can be be found from the angular momentum conservation equation 


Ag sin? 6 + Cncos@ = L-. (19.31) 


Note that the precession rate ¢ is not constant when the top is in general motion. 
With the initial conditions 0 = a, 0 = 0, ¢ = 0, @ = Q, the value of the constant L, 
is AQ sin? a + Cncosa, and equation (19.31) becomes 


Ag sin? 6 = AQ sin? a + Cn(cos a — cos 6). (19.32) 
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Suppose that C?n? > 4AM gh so that steady precession can exist at any inclination. Then 
if the initial precession rate Q is set equal to Q% (the rate of slow steady precession at 
inclination @), the resulting motion of the top is (unsurprisingly) slow steady precession. 
However, if 2 < Q°, it can be shown (by differentiating equation (19.30) with respect to 
t) that the initial value of 6 is positive so that 6 initially increases and is the minimum 
value taken by 9. (In other words, the axis of the top falls initially.) In this case, we can 
see from equation (19.32) that 


AQ sin? a < Ad sin? 6 < AQ sin’ a + Cn(cos a — cos B), 


where f is the maximum value taken by @. It follows that 


(i) If Q > O, then the precession rate ¢ is always positive. This is the case shown in Figure 
19.8 (left). The critical case in which Q2 = 0 (when the axis is released from rest) is 
shown in Figure 19.8 (centre). 

(ii) There is no reason why the top axis cannot be projected the ‘wrong’ way. If Q < 0 
(but is not so negative that AQ sin? w + Crn(cos a — cos B) < 0), then d is sometimes 
positive and sometimes negative! In this case, the path of the axis crosses itself, as 
shown in Figure 19.8 (right). 

(iii) If the value of Q is so negative that AQ sin? a + Cn(cosa — cos B) < 0, then @ is 
always negative. However, this requires a ‘fast’ precession rate that is unlikely to be 
observed. 


In a similar way, one can show that when QS < Q < Q*, the axis of the top rises 
initially and that @ is always positive. The motion of the axis resembles that shown in 
Figure 19.8 (left). 


Example 19.4 The precession during a period of 6 


In the last example, find the angle through which the top precesses during one period 
of the inclination angle 6. 


Solution 


In the last example, the axis of the top is released from rest with @ = 2/3, and with 
C*n* = 4AMgh. With a = 1/3 and Q = 0, the equation for the precession rate d 
becomes 


Cn(1 — 2cos 6) 
~  QAsin26 | 


It follows that ®, the increase in # during one period of the coordinate 0 is given by 


= 6()-90)= | pdt 


Cn i L=2cos6 (mer of (! —Zeent) do 
- = |——_ x —_.—_] —. 
2A 0 sin2 0 A x/3 sin? 0 0 
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FIGURE 19.9 The gyrocompass is a gyroscope whose symmetry axis 
is constrained to remain in a horizontal plane. 


On using the expression for 6 from the last example, we obtain 


sd 1 —2cosé We dé 
o=2/ — © 1.69, 
1/3 Leos @(2 — cos@) sin 0 


which is about 97°. This motion is shown in Figure 19.8 (centre). 


19.7 THE GYROCOMPASS 


The gyrocompass is the most important navigational instrument used in the world 
today. Invented by Elmer Sperry* in 1908, it consists essentially of a gyroscope which 
is smoothly pivoted so that its centre of mass G remains fixed and its axis of symmetry 
remains in the horizontal plane through G; the gyroscope is however free to rotate about 
its axis of symmetry and about the vertical through G (see Figure 19.9). The constraint 
that the axis of the gyro should remain horizontal is vital to its operation. 


Non-rotating Earth 


Consider first the case in which the Earth has no rotation. Let Gx,x2x3 be a frame fixed to 
the Earth, with the axis Gx3 pointing vertically upwards and the axis of the gyro moving in 
the horizontal (x, x2)-plane. Let Gx}x4x3 be a second set of axes with Gx} pointing along 
the axis of symmetry of the gyro; these axes therefore rotate with the gyro. Since the Euler 
angle @ is constrained to be 2/2 in this problem, the system has two degrees of freedom and 
the Euler angles @ and wy (see Figure 19.9) form a set of generalised coordinates. In terms of 
these angles, the angular velocity of the gyroscope is 


w= we, +¢es, (19.33) 


* Elmer Ambrose Sperry (1860-1930), American inventor and entrepreneur. 
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where {e',, €, €3} are the unit vectors of the coordinate system Gx}x4.x3; note that e) and e/, 
are functions of the time. The corresponding angular momentum is therefore 


Lo =Cwhe + Ades, 


where {C, A, A} are principal moments of inertia of the gyro at G. Now consider the applied 
moment KG. Since the gyro is free to rotate about the axes Gx} and G13, it follows that the 
applied moment about these axes is zero. However there is a moment* applied about the axis 
Gx‘, which enforces the constraint that the axis of the gyro should remain in the (x1, x2)-plane. 
Hence the equation of motion of the gyro is 


oe : 
a (Cwe' + Ades) = Ke, (19.34) 


where K is the unknown ‘moment of constraint’. Now 
f 


d 5 : a . dé, ‘ 
a (Cv ei + Ades) = Cwe + Cu + Ades, 


and 


de, de’ , do _ 
dt do dt 


rae, 

Pep. 

The last step follows since the unit vectors e’,, e, are analogous to the polar unit vectors 7, 6 
and hence satisfy the relations 


“i / 
de _ de, _ 


ig = ote (19.35) 


On combining these results, we obtain the three component equations 


v=0, Cob=K, $=0. 
Hence the motion of the gyro has the form 
wan, $=, 


where n and Q2 are constants determined by the initial conditions. Thus the gyro has constant 
spin n and precesses at a constant rate 82. The ‘moment of constraint’ that keeps the gyro axis 
horizontal is K = Cn. This is the complete solution for the gyro on a non-rotating Earth. 


Rotating Earth 


A gyro that simply precesses at a constant rate does not seem much like a direction finding 
device, but everything changes when we introduce the effect of the Earth’s rotation. Suppose 
now that the axes Gx1x2.3 are fixed at some location on the Earth with G.x3 in the direction 
of the /ocal vertical; the axes are oriented so that Gx; points north and Gx2 points west. Then 
Gx 1x2x3 is no longer an inertial frame and our equations of motion must be modified. 


* This moment is applied by the gimbal ring that holds the gyro. 


19.7 The gyrocompass 543 


The angular velocity of the gyro relative to the frame Gx\x2x3 is still given by equation 
(19.33) but now the frame Gx1x2.x3 is not inertial but rotates with the Earth’s angular velocity 
Q* . This can be written in the form 


Q’ = OF sinwe; + Q* cosae; 


= QF sina cos ge, — QF sina sin @ e5 + QF cosae3 (19.36) 


where the angle a is the co-latitude of the location where the gyro is situated. Hence, by the 
addition theorem for angular velocities, the true angular velocity of the gyro (relative to an 
inertial frame) is 


o = (we, tbe) + 2" 
= (v + QF sina cos $) e- (2° sina sing) e+ (4 iOF cos) €3. 
It follows that the true angular momentum of the gyro is given by 
LEe=ac (v + Q sina cos ) e} —A (2 sina sing) e5 +A (4 + 25 cos a) e3. (19.37) 


For the same reasons as before, the applied moment has the form KG = K e’,, where K is the 
unknown moment of constraint. The equation of motion of the gyro is therefore 


dLg 

where Lg is given by equation (19.37). Now 
dL d/. ; de’ 
a = c (v rere ig sina cos @) ey + C (v OF sina cos @) ae 


d de’, - : d 
— A— (2° sina sing) e, — A (2 sin a sing) — 4 4 Age3 + A (4 a8 cos) ae 
dt dt dt 
The vectors {e}, e5, e3} are the unit vectors of the frame Gx}x4x3 and so their (true) rates of 
change are given by 


/ / 
de, dé, 


de3 
xe, — — 
dt dt 


dt 


ok f 


=w* xe), = @* Xe3, 


where w* is the angular velocity of the frame Gx} x/x3 relative to an inertial frame. By the 
addition theorem for angular velocities, this is given by 


o = e3 + a= 
= (2 sin @ COs ) e} — (2° sina sing) e5 + (4 + 2° cos a) e3. 


On combining these results and simplifying, we obtain the three component equations of 
motion for the gyro. As before, the second of these equations serves to determine K and 
the first and third are 


< (v + QF sina cos) = 0, 
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and 
Ad +2 sina [c (v Be ob sina cos @) — AQF sina cos a sing = 0. 
It follows that 
w+ Q° sinacos¢ =n, 


where n is a constant. Hence the spin @ - e’, of the gyro is a constant of the motion. The 
equation for the precession angle ¢@ then becomes 


Ab +QF sing [en — AQ sina cosa | sind = 0. (19.39) 


In practice, the ratio AQ” /Cn is very small so that Cn — AQ sina cosa can be closely 
approximated by Cn. 


Equation (19.39) is the equation for the (large) oscillations of a pendulum about the 
direction ¢ = 0, that is, about the axis Gx ;. Since this axis points north, it follows that 
the axis of the gyro performs periodic oscillations about the northerly direction. If these 
oscillations are damped (a feature not included in our model), then the axis of the gyro 
will eventually settle pointing north. The gyro does therefore ‘home’ to north and (despite 
statements in the literature to the contrary) does not have to be initially set pointing north. 


Example 19.5 The oscillaton period of the gyrocompass 


Estimate the period of small oscillations of the gyro in a typical case. 


Solution 
Ina typical case the ratio Q£ /n < 10~° so that the equation (19.39) is very accurately 
approximated by 


Ab + (cna® sin a) sind = 0, 
which, for small oscillations is approximated by 
Ad + (cnak sina) ¢=0. 


The period of these simple harmonic oscillations is 


a 1/2 
Tt=20 - ; 
G. =a) 


With Q” = 0.000073, n = 50 Hz, C = 2A and a = 45° this gives a period of about 
35 seconds. 


19.8 EULER’S EQUATIONS 


The simplest set of equations for the rotational motion of a rigid body are Euler’s 
equations. They apply to any body, symmetrical or not, but they come with some defi- 
ciencies. 
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Lo 
Ww 
FIGURE 19.10 The principal axes G.xjx2.x3 
are embedded in the body and have the same 
angular velocity as the body. Ly 


Consider a general rigid body with centre of mass G and principal axes Gx1x2Xx3, as 
shown in Figure 19.10. If the body has an axis of dynamical symmetry through G, then 
there are infinitely many sets of principal axes at G and it is possible to find principal 
axes that are not rigidly attached to the body; indeed, this has been an essential feature 
when calculating the angular momentum Lg for a symmetrical body. However, we now 
wish to take a set of principal axes that is rigidly fixed within the body, even when this is 
not necessary. To emphasise the point, we will call these a set of embedded axes at G. 
The most important property of a reference frame embedded in a body is that its angular 
velocity is the same as the angular velocity of the body in which it is embedded. 

Let w, the angular velocity of the body, be expanded in the form 


®=o1e| +4a2€2+ 0363, (19.40) 


where {e1, €2, e3} are the unit vectors of the coordinate system Gx1x2x3. Then the corre- 
sponding angular momentum is 


Lg = Aa e; + Bw2e2 + C363, (19.41) 


where {A, B, C} are the principal moments of inertia of the body. If the body is acted 
upon by the external moment KG, then the equation of rotational motion about G is 


d 
7 (Awier + Borer + Coes) = Kg. (19.42) 


Now 


d . . ; 
i (Aq e] + Baz er + Cw3e3) = Aw) e| + Baze2 +Caw3e3 + 


d d d 
BOI 5 AO ig AO 


A 
“ dt dt dt 


and, since the unit vectors {e1, e2, e3} are embedded in the body, it follows that 


de, de2 de3 
— = WX], — = @Xe@é), — = WXeé3, 
dt dt dt 
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where @ is given by equation (19.40). On combining these results and simplifying, we 
obtain the three component equations 


Euler’s equations 


Aa, —(B —C)ara3 = K 
ie 2 03 1 (19.43) 
Baz —(C — A)a3a1 = Ko 


Ca3—(A— B)a,a2 = K3 


where {@1, @2, w3} and {K,, K2, K3} are the components of the angular velocity @ and 
the applied moment K g in the embedded coordinate system Gx,x2x3. These are Euler’s 
equations of motion. 

As an example, we re-consider a problem solved earlier: the free motion of a body 
with axial symmetry. 


Example 19.6 Body with axial symmetry 


Solve Euler’s equations for the free motion of a body with axial symmetry. 


Solution 


Let Gx3 be the axis of symmetry so that B = A. In free motion, K} = K2 = K3 = 0 
and the Euler equations reduce to 


AQ, = (A —C)a203 = 0, 
Aa@2 — (C — A)@3a1 = 0, 
Ca3=0. 


The third equation gives 
03 =N, 


where 7 is a constant that we recognise as the spin w - e3. The first two equations then 
reduce to 


@, — Qa, = 0, a2 + Qa, = 0, 


where Q = (A —C)n/A. On eliminating w2, we find that @ satisfies the SHM 
equation 


6, + Qa = 0, 
which has general solution 


@, = Psin(Qt+y), 
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FIGURE 19.11 Relative to the embedded 
frame, @® and Lg precess around the 
symmetry axis. 


where P and y are constants of integration; the corresponding solution for 2 is 
w2 = P cos(Qt + y). 
The general solution of the Euler equations in this case is therefore 


@, = Psin(Qt+y), 
a2 = Pcos(Qt+y), 


03=Nn, 


where P and y are constants. The corresponding components of the angular momen- 
tum vector LG are 


L,=APsin(Qt+y), 
Ly = AP cos (Qt+y), 
L3=Cn. 


The geometrical interpretation of these results is that the vectors e3, w and Lg all 
lie in the same plane, where they make consant angles wth each other, and that this 
plane rotates around the embedded axis Gx3 at the constant rate (A — C)n/A. This 
situation is shown in Figure 19.11 for the case in which A > C. 


Motion of the body 

What we have found is the time variation of @ and L as seen from the principal 
embedded frame Gx,x2x3. The true picture (relative to an inertial frame) is quite 
different. In this frame, Lg is constant and the time variation of w and the motion of 
the body have yet to be determined. In general, deducing the motion of the body is 
more difficult than solving the Euler equations. In the present case though, this can 
can be done fairly easily. From the above expressions for w and Lg, it follows that 
can be written in the form 
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GC 
——])ne 
A 3 


A-C 
Eg t+ “AC L363 


A- 
Lg+ (==) (|LgG | cosa) e3, 


where qa is the constant angle between Lg and the axis Gx3. If we now write L = 
|Lg| and k = Lg/L, then 


L A-C 
=—k+L —— , 
@ A + cosa ( A Je 


which is precisely the solution found in section 19.4. m 


Deficiencies of Euler’s equations 


The above example illustrates the first deficiency in Euler’s equations: 


The solutions of Euler’s equations yield the time variation of @ as seen from the embedded 
reference frame. The position of the body is still unknown. 


Thus Euler’s equations do not tell us where the body is at time t, which is the most 
important unknown in the problem. In the example above, we were able to deduce the 
true motion of the body fairly simply. However, for an unsymmetrical body (which is 
the problem of principal interest that remains), there is no simple method and a further 
substantial calculation is required. 

A second (but related) deficiency appears when the applied moment K G is not zero: 


When the applied moment K g is not zero, the components K1, K2, K3 will not generally 
be known since the orientation of the body is unknown. In such a case, one cannot even 
begin the solution. 


For example, in the problem of the spining top, the moment of the gravity force is 
known, but the components of this moment in the embedded reference frame are not 
known. The right sides of Euler’s equations for the spinning top are therefore unknowns. 


Dynamical balancing 


The difficulties mentioned above disappear when the motion of the body is given, and we 
wish to find the moments that are being applied to the body to permit that motion. In such 
a case, Euler’s equations give the answer immediately. Consider the case of an axially 
symmetric body which is smoothly constrained to rotate about a fixed axis through G but 
is inaccurately mounted so that the rotation axis is not quite coincident with the symmetry 
axis, as shown in Figure 19.12. What effect does this mis-alignment have on the pivots? 
Let the symmetry axis be Gx3 and take the embedded axis Gx2 to be perpendicular 
to the rotation axis P Q; the direction of the embedded axis Gx is then determined (see 
Figure 19.12). If the body is rotating about PQ at a constant rate A, then the angular 
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FIGURE 19.12 The axially symmetric body is constrained 
to rotate about the fixed axis PQ which makes an angle f 
with the axis of symmetry. 


velocity components in the principal embedded frame Gx}x2x3 are 


@,; = —AsinB 
a2 = 0 


w3 = Acos B. 


To find the moment exerted by the pivots, we simply substitute these expressions into the 
left sides of Euler’s equations. This gives 


K, =0, Ky = (A—C)A’ sin B cos B, K3 =0. 


Thus the applied moment is constant in the embedded frame. However, it is not constant in 
an inertial frame. The axis G-x2 is perpendicular to P Q and rotates around it at a constant 
rate 2. The pivots are therefore subjected to oscillating forces with angular frequency x 
and magnitude proportional to 27. In such a configuration, the body is said to be dynam- 
ically unbalanced. Since G lies on the rotation axis, the body can rest in equilibrium in 
any position, but, when rotating, it exerts oscillating forces on the pivots that may result 
in vibration of the mounting. This is why the wheels of motor cars must be re-balanced* 
when a new tyre is fitted. For a turbine with a mass of many tonnes rotating at 50 Hz say, 
extremely accurate alignment is needed, for otherwise the forces needed to restrain the 
turbine would be so large that it would break loose its mountings! 


19.9 FREE MOTION OF AN UNSYMMETRICAL BODY 


The free motion of unsymmetrical bodies is an important topic in space technol- 
ogy because of its application to the tumbling motion of spacecraft and space satellites. 


* This is done by adding lead weights at determined positions around the rim of the wheel. 
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FIGURE 19.13 The unsymmetrical body 
moves under no forces with G at rest. 


Such motions are considerably more difficult to analyse than for bodies with axial sym- 
metry. The reader may be surprised then to learn that there is an exact analytical solution 
to the problem of an unsymmetrical body moving under no forces (see Landau & Lifshitz 
[6], Chapter VI). However, since this solution is expressed in terms of an impressive array 
of elliptic functions and theta functions, it gives one almost no idea of what the motion is 
actually like! Instead, we will use Euler’s equations combined with geometrical argu- 
ments. This enables us to make interesting predictions about the motion without too much 
analysis. 


Rotation of a free body about a fixed axis 


Suppose a free unsymmetrical body 6 (an asteroid, for example) is observed from an 
inertial frame in which its centre of mass G is at rest. Is it possible for the body to rotate 
about a fixed* axis? 


This is one of the few questions about unsymmetrical bodies that can be answered 
quite easily. If such an axis does exist, then it must pass through the fixed point G. Let 
Gx1X2Xx3 be a set of principal axes of 6 at G. Since B is unsymmetrical, these principal 
axes are unique (apart from labelling) and are therefore necessarily embedded in B; Euler’s 
equations therefore apply. 

The kinetic energy of B is given by T = 1|e|?, where J is the moment of inertia of B 
about the rotation axis and @ is its angular velocity. Since B is rotating about a fixed axis, 
I is aconstant. Also, since the motion takes place under no forces, the kinetic energy T is 
constant. It follows that the angular speed |@| must be constant and that the components 
{@1, 2, ©3} Of @ in the embedded principal frame Gx,x.2x3 must be constants. Hence 
@| = @2 = w3 = 0 and Euler’s equations then imply that 


(B — C)a203 = (C — A)a3a@1 = (A — B)aja2 = 0, 


* This means an axis fixed in space and fixed in the body. 
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where {A, B, C} are the principal moments of inertia about the axes Gx1, Gx2, Gx3. Since 
B is an unsymmetrical body, A, B and C are all different and so 


0203 = 0301 = W102 = 0. 


The only possible motions are therefore 


(i) wy, =constant, @2 = 03 = 0, 
(ii) @2 = constant, 03 = w, = 0,7 
(iil) w3 = constant, @| =a. =0. 


The first solution corresponds to the body rotating with constant angular speed about the 
principal axis Gx,. The other solutions correspond to rotations about the principal axes 
Gx2 and Gx3. Hence: 


Free unsymmetrical body rotating about a fixed axis 


A free unsymmetrical body can only rotate about a fixed axis if this axis is one of the 
three principal axes of the body at G. The rotation will then have constant angular 
speed. 


For example, if it is required that a spacecraft should rotate about a fixed axis, this axis 
must be one of the three principal axes through its centre of mass. Later we will examine 
the stability of each of these steady motions. 


General motion of a free unsymmetrical body 


Once again we will suppose that the motion is viewed from an inertial frame in which G is 
at rest. It will be convenient to express the Euler equations in terms of the components of 
L in the frame Gx1x2x3 (instead of the components of w). Since L; = Aa, L2 = Ban 
and L3 = C3, this gives the equivalent system of equations 


BCL, = (B —C)L2L3, 
AC Lp = (C — A)L3L1, (19.44) 
ABL3 = (A—B)L,L». 


If we multiply the first equation by AL, the second by BL2, the third by CL3 and add, 
we obtain 


ABC (LiL, + LoL2+L3L3) =0, 
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FIGURE 19.14 The L-sphere and the curves 
in which it meets different T -ellipsoids for 
the case A < B <C. 


which integrates to give 
(Pisa te Sa, (19.45) 


where L is a constant. Similarly, if we multiply the first equation by L1, the second by 
Lz, the third by L3 and add, we obtain 


BC LiL, + AC LoL + ABL3L3 =0, 
which integrates to give 


2 2 2 
cere 


=2T 19.4 
A B Cc , oo 


where T is a constant. The first of these integrals means that the magnitude of Lg is 
conserved” and the second means that the kinetic energy of the body is conserved. 

The equations (19.45), (19.46) place geometric restrictions on the time variation of 
Lg, as seen from the embedded frame. Equation (19.45) means that, if Lg is considered 
to be the position vector of a ‘point’ Z in (L1, L2, L3)-space, then that point must move 
on the surface of a sphere with centre O and radius L, which we will call L-sphere. In the 
same way, equation (19.46) means that the L-point must also move on the surface of the 
ellipsoid’ in (L1, Lz, L3)-space defined by equation (19.46). We will call this ellipsoid, 


* Note that, in the embedded frame, the individual components L1, Ly, L3 are not conserved, but the 
1/2 
magnitude G + ig + 13) is. 
* An ellipsoid is the three-dimensional counterpart of an ellipse. The standard equation of an ellipsoid with 
semi-axes a, b,c is 
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which has the semi-axes (2AT)!/2, (2BT)!/*, (2CT)!/2, the T-ellipsoid. It follows that 
the path of the L-point must be a curve of intersection of the L-sphere and the T -ellipsoid. 
Figure 19.14 shows the L-sphere for a fixed value of the parameter L. The paths on its 
surface are examples of the curves in which it is cut by the T-ellipsoid for different values 
of the parameter T. 


Paths of the L-point 


Suppose that A < B < C and that L is fixed. For each value of L, values of T that are 
too small or too large are not admissible. Indeed we can see from equation (19.46) that 


If T lies outside this range, then the L-sphere and the T-ellipsoid do not intersect. 


(i) When T = Typin the intersection consists of the two isolated points (0,0, +L). The 
constant solution L = (0,0, L) corresponds to steady rotation in the positive sense 
about the principal axis Gx3. The solution L = (0,0, —L) corresponds to a steady 
rotation in the opposite sense. 

(ii) Suppose now that T is slightly increased. The intersection point (0, 0, L) then becomes 
a small closed curve* on the surface of the L-sphere that encircles the point (0, 0, L) 
as shown in Figure 19.14. The L-point must therefore move along this curve. The 
direction of motion of the L-point can be deduced by examining the signs of Ly and 
Ly given by the first and second equations of (19.44). As T increases, the intersection 
curve becomes larger (and changes its shape) but it still encircles the point (0, 0, L). 
This pattern continues until T = L?/2B. 

(iii) When T = L?/2B, the L-sphere and the T -ellipsoid touch at the points (0, +L, 0) and 
the intersection curves are the dashed lines in Figure 19.14. This is a transitional case 
between two different regimes of curves. It includes the case of steady rotation about 
the principal axis Gx2. 

(iv) When T > L?/2B, the intersection curves are completely different. Now they encircle 
the point (L, 0, 0) (or (—L, 0, 0)) and, as T approaches Tax they shrink to zero around 
these points. The case T = Tax corresponds to steady rotation about the principal axis 
Gx. 


Stabilty of steady rotation about a principal axis 


We can make some interesting deductions from the form of the above paths. 


(i) In the general motion, the L-point moves around a closed curve. It follows that the 
time variation of L, as seen from the embedded frame, is periodic, a property not 
apparent from equations (19.44). This does not however mean that the motion of the 
body is periodic. 

(ii) If the body is in steady rotation in the positive sense about the axis Gx; and is slightly 
disturbed, then the path of L remains close to the point (L, 0,0). This implies that, 


* The projection of this curve on to the (+1, x2)-plane is a small ellipse with centre the origin. 
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in an inertial frame where L is constant, the axis Gx, remains close to L. In other 
words, the original steady rotation about the axis Gx, is stable to small disturbances. 
The same applies to a steady rotation about the axis G.x3. However, if a steady rotation 
about the axis Gx2 is slightly disturbed, the paths of the L-point lead far away from 
the original constant positions at (0, +L, 0). This implies that a steady rotation about 
the axis Gx2 is unstable to small disturbances. Hence: 


The steady rotation of an unsymmetrical body about a principal axis is stable for the 
axes with the greatest and least moments of inertia, but unstable for the other axis. 


Thus, it is useless to try to stabilise a satellite in steady rotation about the principal axis 
with the ‘middle’ moment of inertia. 

(iii) The stability argument above is modified if the body has a means of energy dissipation. 
This cannot happen with rigid bodies, but real bodies, such as a satellite, can flex and 
slowly dissipate mechanical energy as heat. In this case, steady rotation of the body 
about the axis with the /east moment of inertia (which corresponds to T = Tmax) 
is no longer stable. The ‘radius’ of the path of LZ around the axis Gx, will slowly 
increase until T = L?/(2B) when it will switch to a large path encircling one of the 
points (+0, 0, +L). This path will then gradually shrink to zero. Thus, in the presence 
of energy dissipation, the body will end up rotating about the axis with the largest 
moment of inertia. In the early days of space exploration* this fact was learned by hard 
experience! 


Motion of the body 


Suppose that Euler’s equations have been solved for the unknowns @, w2, w3. What is 
the corresponding motion of the body? Let {e1, e2, e3} be the unit vectors of the principal 
embedded frame G.x1x2x3. Then the time variation of these vectors, as seen from an 
inertial frame, is given by 


€] = Xe], 62 = Xe. €3 = WX e3. 


Unfortunately, these equations are not really uncoupled since Euler’s equations give us @ 
in the form 


@= 01, e€| + 02€2+ 0363, 


which also involves the unknown vectors {e;, e2, e3}. On combining these equations, we 
get the set of coupled ODEs 


€) = @3e2 — 73, 
€o = W1 €3 — 03], (19.47) 


€3 = @2e| — a 2, 


* Kaplan [5] recounts the following incident: The first U.S. satellite, Explorer I, was set into orbit rotating 
about its longitudinal axis, which was the principal axis with the /east moment of inertia. After only a 
few hours, radio signals indicated that a tumbling motion had developed and was increasing in amplitude 
in an unstable manner. It was concluded that the satellite’s flexible antennae were dissipating energy and 
causing a transfer of body spin from the axis of minimum inertia to a transverse axis of maximum inertia. 
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for the unknown vectors {e1, e2, e3}. These equations cannot usually be solved explicitly, 
but they are suitable for numerical integration (see Problem 19.17) and for approximate 
solution by perturbation analysis. This is illustrated by the following example. 


Example 19.7 Motion of a principal axis in space 


An unsymmetrical body is in steady rotation about the principal axis with the largest 
moment of inertia. Find an approximation to the wobble of this principal axis if the 
body is slightly disturbed. 


Solution 


Suppose that A < B < C and that, in the initial motion, w; = @2 = 0 and @3 = A, 
where A is a constant. In the disturbed motion we suppose that w1, w2 (and their time 
derivatives) remain small. If we now apply Euler’s equations and then linearise, we 
obtain 


Aa, — (B —C)Aa = 0,7 
Baz — (C — A)Aa = 0,7 
Ca3 = 0, 


Hence, in the linear approximation to the disturbed motion, the coupled equations for 
@1, @2 are easily solved to give 


wo = €A ((C — B)/A)!? cos(Qt + y), 


a, =€eA((C — A)/B)!/? sin(Qt + y), (19.48) 


where €, y are dimensionless constants and 


_ AVC —By\? 
Sk (A | . 
AB 


The corresponding expressions for L;, L2 are 


L, =€A(A(C — B))!/? cos(Qr + y), 


Ly =€A(B(C — A))'/? sin(Qt + y). (19.49) 


Hence the projection of the path of the L-point on to the (x1, x2)-plane is an ellipse 
executed in the anti-clockwise sense. This is entirely consistent with the paths shown 
in Figure 19.14 and our discussion of stability. 

Now to find the actual time variaton of the unit vector e3. In the initial motion 
we have 


e, = cos Ati+sinAt j, 
e2 = —sinAti+cos At j, 
e,=k, 


where {i, j, k} is a fixed orthonormal set. Since w,, w2 are small in the disturbed 
motion, the vectors e; and e2 that appear in the third equation of (19.47) can be 
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FIGURE 19.15 The wheel rolls on a rough horizonal plane. The dashed curve is the path of the 
point of contact C. 


replacd by their steady (zero order) approximations to give 
€3 = w2 (cos Ati + sin At J) — @ (— sin Ati + cos At j) 


where w, and w2 are given by the expressions (19.48). This equation can now be 
integrated explicitly to give the first order approximation to the wobble of the axis 
vector é3. 

We will not actually do this integration because the result is messy and the details 
of the formula are less interesting than its structure. By examining the form of the 
right side of the above equation one can see that e3 must have the form e3 = k + 
(small terms in the i and j directions ), where these small terms contain oscillating 
functions with two different frequencies Q + A and Q — A. In general the ratio 
of these frequencies will not be a rational number and so the motion of the axis 
vector e3 is not periodic. In fact, the projection of e3 onto the fixed plane through 
G perpendicular to k describes a Lissajous type path bounded by two ellipses. (See 
however Problem 19.14.) 


19.10 THE ROLLING WHEEL 


Suppose a wheel is free to ro// on a rough horizontal floor. How does it move? 

This, our final problem, typifies problem solving in classical mechanics. Nothing could 

be easier to describe, but few could just sit down and write out the solution to this classic 
problem. 

Since the rolling condition is a non-holonomic constraint, the rolling wheel cannot 

be solved by the Lagrangian methods presented in this book.* We therefore use vectorial 

mechanics. Figure 19.15 (left) shows a thin circular disk’ of mass M and radius b rolling 


* For an extension of the Lagrangian method to non-holonomic systems, see Goldstein [4], chapter 2. 
* The disk need not be uniform, but it must have axial symmetry about its centre. For example, it could be 
a hoop. 
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on the (x, y)-plane. The thick dashed curve is the path of the point of contact C. The 

vector a (= a(t)) is the axial unit vector and the vector n (= n(t)) is the unit vector in 
— 

the direction CG. Since the disk has axisymmetry, the angular velocity of the disk can be 

expressed in the form 


@=axat+da, (19.50) 


where the spin A (= @ - a) is an unknown scalar function of the time. The corresponding 
angular momentum Lg is then given by 


Lg = Aaxa+Chia, (19.51) 


where {A, A, C} are the principal moments of inertia of the disk at G. 
The disk moves under the forces shown in Figure 19.15 (right). It follows that the 
equation of translational motion of the disk is 
dV 


M—=X-—Mcek, 19.52 
Ai 8 ( ) 


where V is the velocity of G, and that the equation of rotational motion is 


d : 

77 (Aaxa + Cha) = (—bn)x X. (19.53) 
In addition, we have the rolling condition that the particle of the disk in contact with the 
floor has zero velocity, that is 


V +@x(—bn) = 0. (19.54) 


Equations (19.52)—(19.54) are the governing equations of the motion. On eliminating X 
and V, we obtain the equation 


d : ‘ ; 3 

7 (Aaxa+Cha) + Mb? [o-(@-n)n—(w-n)n]+Mgbnxk=0. (19.55) 
where @ is given by equation (19.50). The vectors a and n are not independent. In fact n 
can be expressed in terms of a and k in the form 


k—(k-a)a 


Thus it is possible in principle to eliminate n from equation (19.55) and obtain a vector 
equation solely in terms of the unknown axial vector a and the spin 4. This equation is 
sufficient to determine a and i. 

All this begins to look easier when we take components of the vector equation (19.55) 
in three well chosen directions, namely, the (instantaneous) principal directions a, n and 


558 Chapter 19 Problems in rigid body dynamics 


axn. On taking the scalar product of equation (19.55) with each of these vectors, we 
obtain the scalar equations 


(C + Mb?)i — Mb? [a,a,n](a-a) = 
Ala,a,n|+Ca(a-n) 

(A + Mb’) (@+n) —(C + Mb’) [a,a,n]— 
Mb? [a,a,n] [a,n, n| — Megb(a-k)=0. 


’ 


; (19.57) 


0 
0 


Here [ p, g, r] means the triple scalar product of the vectors p, qg and r. These equations 
look better still when expressed in terms of scalar variables. Let 6 be the angle between 
the plane of the disk and the vertical, and let ¢ be the angle between the tangent line to the 
disk at C and the axis Ox, as shown in Figure 19.15 (left). Then a and n take the form 


a = (cos@ sin @)i — (cos@ cos ¢) j + (sind) k, (19.58) 
n = —(sin6@ sin @)i + (sin@ cos d) j + (cos@)k, (19.59) 


where {i, j, k} are the constant unit vectors of the inertial frame Ox yz. In terms of 6 and 
@, the equations (19.57) become 


(C + Mb*)i + Mb’ cos 06¢ = 0, 
A (cos6¢—2sin06¢)+Crd=0, — (19.60) 
(A + Mb?)6 + Acos6 sin@ ¢”? — (C + Mb”)acos0¢ — Mgbsiné = 0. 


This is our final form of the governing equations for the rolling disk. Note that, unlike 
other problems involving an axially symmetric body, the spin A is not a constant of the 
motion, but must be solved for, along with the angular coordinates 0, 6. The complexity of 
these coupled non-linear equations is such that explicit solutions are rare. However, they 
are suitable for numerical integration and (if they can be linearised) for finding approxi- 
mate solutions. This is illustrated by the next example. 


Example 19.8 Stability of straight line rolling 
How fast must a wheel roll in a straight line so that it is stable to small disturbances? 


Solution 


In the unperturbed motion (along the x-axis say), we have 0 = 0,@ = OandA = A, 
where A is a positive constant. In the perturbed motion, we suppose that 6, ¢ and 
A. — A (and their time derivatives) remain small so that we may linearise equations 
(19.60). This gives 


A=0, 
Ad+CA6 =0, 
A(A + Mb’)6 — (C + Mb?)A¢ — Mgbo = 0. 
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Hence, in this linear approximation, 4 = A, a constant. The second equation gives 
Ago +CA@ = constant, 


and, in order to be satisfied by the unperturbed state, the constant must be zero. On 
substituting this into the third equation, we obtain 


A(A + Mb?)6 +| A°7C(C + Mb?) — AMgb | 6 = fj (19.61) 


If the expression in the square brackets is positive, this is the SHM equation and 6 
will perform small oscillations about 6 = 0; the same is then true of @. Hence, the 
condition for stability of straight line rolling is 


A?C(C + Mb’) — AMgb > 0, 
that is 


2, _ AMgb* _ 
C(C + Mb?)’ 
where V is the speed of the disk. For a hoop with A = 4Mb* andC = MD’, this 
condition becomes V? > gb/4. 


Path of the point of contact 


If r is the position vector of C, the point of contact, then 
r= R-—bn, 


where R is the position vector of G. It follows that 


dr _ V —b; 
dt me 
where V is the velocity of G. On using the rolling condition (19.54) and the expression 


(19.50) for , the differential equation for r becomes 


dr : : 

GF = OL @x@xn+rhaxn—n), (19.62) 
This is the path equation for the point of contact. Once the equations (19.60) have been 
solved for A, 6 and @, the right side of (19.62) is known and the path of C can then be 
found by integration. Numerical integration shows that some possible paths can be quite 
exotic, as shown in Figure 19.16. 


Stability of the bicycle 


Our analysis of the rolling wheel gives some idea of the complexity of the mechanics of 
the bicycle. Indeed, the question of stability is not completely settled and rival sets of 
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FIGURE 19.16 Two not-so-typical paths for the point of contact of a rolling wheel. 


equations appear in the literature. Probably the most important paper on bicycle physics is 
that by Jones.* Jones constructed a non-standard bicycle in which the angular momentum 
of the wheels (the ‘gyroscopic effect’) was cancelled out by fitting extra wheels that spun 
in the opposite direction. It turned out that this bicycle was just as stable as a normal 
one and could be ridden ‘no hands’. Despite Jones’s findings, the myth that a bicycle is 
stabilised by the the gyroscopic effect of the spinning wheels still persists. However, the 
concensus of informed opinion is that the stability of a bicycle depends crucially on the 
geometry of the forks holding the front wheel. Jones constructed another non-standard 
bicycle in which the front wheel was four inches ahead of its normal position; it was 
almost unridable! The website http://ruina.tam.cornell.edu/research/index.htm 
has references to further interesting work on bicycle physics. 


Problems on Chapter 19 


Answers and comments are at the end of the book. 


Harder problems carry a star (). 


Rolling balls 


19.1 Ball rolling on a slope A uniform ball can roll or skid on a rough plane inclined at an 
angle £ to the horizontal. Show that, in any motion of the ball, the component of @ perpendic- 
ular to the plane is conserved. If the ball ro//s on the plane, show that the path of the ball must 
be a parabola. 


19.2 Ball rolling on a rotating turntable * A rough horizontal turntable is made to rotate 
about a fixed vertical axis through its centre O with constant angular velocity Qk, where the 
unit vector k points vertically upwards. A uniform ball of radius a can roll or skid on the 
turntable. Show that, in any motion of the ball, the vertical spin @ - k is conserved. If the ball 
rolls on the turntable, show that 


V =2QkxV, 


* Jones, D.E.H., The stability of the bicycle, Physics Today 23 (1970) pp. 34-40. 
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where V is the velocity of the centre of the ball viewed from a fixed reference frame. Deduce 
the amazing result that the path of the rolling ball must be a circle. 

Suppose the ball is held at rest (relative to the turntable), with its centre a distance b from 
the axis {O, k}, and is then released. Given that the ball rolls, find the radius and the centre of 
the circular path on which it moves. 


19.3 Ball rolling on a fixed sphere* A uniform ball with radius a and centre C rolls on the 
rough outer surface of a fixed sphere of radius b and centre O. Show that the radial spin @ - c 


aad 
is conserved, where c (= c(t)) is the unit vector in the radial direction OC. [Take care!] Show 
also that c satisfies the equation 


T(a+b)cexé+2anc+5gexk=0, 


where n is the constant value of  - c and k is the unit vector pointing vertically upwards. 

By comparing this equation with that for the spinning top, deduce the amazing result that 
the ball can roll on the spherical surface without ever falling off. Find the minimum value of 
n such that the ball is stable at the highest point of the sphere. 


Axisymmetric bodies 


19.4 Investigate the steady precession of a top for the case in which the axis of the top moves 
in the horizontal plane through O. Show that for any n ¥ 0 there is just one rate of steady 
precession and find its value. 


19.5 The sleeping top By performing a perturbation analysis, show that a top will be stable 
in the vertically upright position if 


C2n? > 4AM gh, 


in the standard notation. [You can do this by either the vectorial or Lagrangian method. ] 


19.6 Estimate how large the spin n of a pencil would have to be for it to be stable in the 
vertically upright position, spinning on its point. [Take the pencil to be a uniform cylinder 15 
cm long and 7 mm in diameter. ] 


19.7 A juggler is balancing a spinning ball of diameter 20 cm on the end of his finger. Esti- 
mate the spin required for stability (i) for a uniform solid ball, (11) for a uniform thin hollow 
ball. Which do you suppose the juggler uses? 


19.8 Solve the problem of the free motion of an axisymmetric body by the Lagrangian 
method. Compare your results with those in Section 19.4. [Surprisingly awkward. Take 
the axis Gz of coordinates Gx yz to point in the direction of Lg and make use of the Lagrange 
equations for the Euler angles ¢@ and w.] 


19.9 Frisbee with resistance A (wobbling) frisbee moving through air is subject to a fric- 
tional couple equal to K w. Find the time variation of the axial spin A (= @ + a), where a is the 
axial unit vector. Show also that a satisfies the equation 


Aaxa+Kaxa+Cia=0. 
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* By taking the cross product of this equation with a, find the time variation of | a|. Deduce 
that the angle between w and a decreases with time if C > A (which it is for a normal frisbee). 
Thus, in the presence of linear resistance, the wobble dies away. 


19.10 Spinning hoop on a smooth floor A uniform circular hoop of radius a rolls and slides 
on a perfectly smooth horizontal floor. Find its Lagrangian in terms of the Euler angles, and 
determine which of the generalised momenta are conserved. [Suppose that G has no horizontal 
motion. ] 

Investigate the existence of motions in which the angle between the hoop and the floor is 
a constant a. Show that , the rate of steady precession, must satisfy the equation 


cosa Q? — 2nQ ~92 cota = 0, 
a 


where n is the constant axial spin. Deduce that, for n 4 0, there are two possible rates of 
precession, a faster one going the ‘same way’ as n, and a slower one in the opposite direction. 
[These are interesting motions but one would need a very smooth floor to observe them.] 


Euler’s equations 


19.11 Bicycle wheel A bicycle wheel (a hoop) of mass M and radius a is fitted with a smooth 
spindle lying along its symmetry axis. The wheel is spun with the spindle horizontal, and the 
spindle is then made to turn with angular speed Q about a fixed vertical axis through the centre 
of the wheel. Show that n, the axial spin of the wheel, remains constant and find the moment 
that must be applied to the spindle to produce this motion. 


19.12 Stability of steady rotation An unsymmetrical body is in steady rotation about a prin- 
cipal axis through G. By performing a perturbation analysis, investigate the stability of this 
motion for each of the three principal axes. 


19.13 Frisbee with resistance Re-solve the problem of the frisbee with resistance (Problem 
19.9) by using Euler’s equations. [Find the time dependencies of w3 and Ont + Or 


19.14 Wobble on spinning lamina An unsymmetrical lamina is in steady rotation about the 
axis through G perpendicular to its plane. Find an approximation to the wobble of this axis if 
the body is slightly disturbed. [This is a repeat of Example 19.7 for the special case in which 
the body is an unsymmetrical /amina; in this case C = A+B and there is much simplification. ] 


19.15 * Euler theory for the unsymmetrical lamina An unsymmetrical lamina has principal 
axes Gx1x2x3 at G with the corresponding moments of inertia {A, B, A + B}. Initially the 
lamina is rotating with angular velocity Q about an axis through G that lies in the (x1, x2)- 
plane and makes an acute angle w with Gx ;. By using Euler’s equations, show that, in the 
subsequent motion, 


2 2 2 


(B — A)oZ + (B+ A)oy = (B— A)Q’ sin’ a. 
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Interpret these results in terms of the motion of the ‘@-point’ moving in (@1, w2, w3)-space 
and deduce that @ is periodic when viewed from the embedded frame. 

Find an ODE satisfied by w2 alone and deduce that the lamina will once again be rotating 
about the same axis after a time 


4 i dé 
Q\B+A 0 (1—sin? a sin? 9)!/2° 


Computer assisted problems 


19.16 Solve Lagrange’s equations for the top numerically and obtain the motions of the axis 
shown in Figure 19.8. Use Euler’s angles as coordinates and use computer assistance to obtain 
the equations as well as solve them. [It seems easier not to use the conservation relations for 
P@ and py, in their integrated form, since this requires the initial conditons to be incorporated 
into the equations of motion.] 


19.17 Obtain the paths of the L-point for an unsymmetrical body, as shown in Figure 19.14. 
These were obtained by solving Euler’s equations, together with the equations 


€) = w3e2 — 0263, €2 = w1€3 — W3e1, €3 = wre] — w1€2, 


numerically. [Mathematica handled the solution of these twelve simultaneous equations with 
ease! ] 


Appendix 


Centres of mass 
and moments of inertia 


A.1| CENTRE OF MASS 


Suppose we have a system S of particles P,, P2,..., Py with masses m1, m2, 
..., My, and position vectors r1, r2,..., “Nn respectively. Then the centre of mass of S is 
defined as follows: 


Definition A.1 Centre of mass The centre of mass of S is the point of space whose 
position vector R is defined by 


pm mini a vier (A.1) 


aan mij M 


where M is the total mass of S. 


In Chapter 2, we gave some simple examples of centre of mass, but here we will 
suppose the system is a rigid body 6. Then G, the centre of mass of B, moves as if it 
were a particle of the body and its position can be calculated once and for all. 


Finding centres of mass by symmetry 


For most bodies, the position of G can be found by symmetry arguments so that no sum- 
mations (or integrations) are needed. Instead, one simply applies the following rules which 
follow fron the definition (A.1) : 


Rule 1: /f the body has reflective symmetry in a plane, then the centre of mass must lie 
in this plane. 


Rule 2: Jf the body has any rotational symmetry* about an axis, then the centre of mass 
must lie on this axis. 


* For a body to have a rotational symmetry, the mass distribution need not be preserved for all rotation 
angles about the axis. Just one angle (less than 277) is enough. 
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FIGURE A.1_ A semi-circular lamina of radius L 
a occupies the region A shown. —a a 


Suppose for example that the body 6 is a uniform circular cylinder occupying the 
region x? 4+ y? <a?,—b <z <b. Then B has reflective symmetry in the plane z = 0 and 
so G lies in this plane. B also has full rotational symmetry about the z-axis and so G must 
lie on this axis. It follows that G must be at the origin. Similarly, if 6 is a uniform lamina 
in the shape of an equilateral triangle, 6 has rotational symmetry about each of its three 
medians. G must therefore lie at the intersection of the medians. 

Not all bodies have enough symmetry to find G completely however. If 6 is a uniform 
circular cone, then it has full rotational symmetry about the axis connecting its vertex 
to the centre of its base. G must therefore lie somewhere on this axis but there is no 
other symmetry to determine exactly where. In such cases, a summation (or integration) 
becomes neccessary. This process is illustrated by the following two examples. 


Example A.1 Centre of mass of a semi-circular lamina 


Find the position of the centre of mass of a uniform semi-circular lamina. 


Solution 
Suppose that the lamina has radius a and occupies the region A shown in Figure A.1. 
The lamina has rotational symmetry about the y-axis so that G must lie on the y-axis, 
as shown. It remains to find its y-coordinate Y which, from the definition (A.1), is 
given by 
N 
MY =) miyi, (A.2) 
i=l 
where M is the mass of the lamina. However, since the lamina is a continuous distri- 
bution of mass, this sum must now be interpreted as an integral. 
Consider an element of area d A of the lamina; this has mass Md A/(57a”). The 
contribution to the sum in equation (A.2) from this element is therefore 


(MaA/(47a"))y, 
where y is the y-coordinate of the element. On ‘summing’ these contributions (and 
cancelling by M) we find that 


Y= ydA, (A.3) 
A 


qa 
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A 


=e 


where the integral is taken over the region A occupied by the lamina. 

This double integral is most easily evaluated using standard polar coordinates. 
In these coordinates* dA = (dr)(rd0) = rdrd0@, and y = rsin@. The ranges of 
integration forr, 0 areO <r <aand0 < 0 < z. We therefore obtain 


FIGURE A.2 A solid hemisphere of radius a = 4 


occupies the region Y shown. 


Y 


2 
ma? r=0 


2 r= d= 
a (| r ar) (/ sind ao) 
wa r=0 6=0 


_ 2 (13 ey 
= —, (40°) (2) = = 0.424 a. 


r=a 6=n 
/ r? sin @ dr d@ 
6=0 
a 
2 


Hence the centre of mass of the semi-circular lamina is on the y-axis a distance 
4a/3x from the origin. @ 


Example A.2 Centre of mass of a solid hemisphere 


Find the position of the centre of mass of a uniform solid hemisphere. 


Solution 

Suppose that the lamina has radius a and occupies the region VY shown in Figure A.2. 
The hemisphere has full rotational symmetry about the z-axis so that G must lie on 
the z-axis, as shown. It remains to find its z-coordinate Z which, from the definition 
(A.1), is given by 


N 
MZ = yo mizi, (A.4) 


where M is the mass of the hemisphere. However, since the hemisphere is a continu- 
ous distribution of mass, this sum must now be interpreted as an integral. 

Consider a volume element dv of the hemisphere; this has mass Mdvu/ (37a’). 
The contribution to the sum in equation (A.4) from this element is therefore 


(Mav/(3xa’)):, 


* See Figure 3.6 (right) for a diagram of the element of area in plane polars. 
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where z is the z-coordinate of the volume element. On ‘summing’ these contributions 
(and cancelling by M) we find that 


3 
Z = — A. 
sa | 24 (A.5) 


where the integral is taken over the region V occupied by the hemisphere. 

This volume integral is most easily evaluated using standard spherical polar 
coordinates r, 0, @. In these coordinates* dv = (dr)(rd@)(rsinOd¢) = 
r*sin@ dr dO dg, and z = rcos@. The ranges of integration for r, 9 and @ are 
O<r<a,0<0<z7/2and0 < ¢ < 27. We therefore obtain 


3 r=a pO=n/2 po=20 
L= 5 / / i r? sin@ cos 6 dr d0 do 
2ma* Jr—0 Jo=0 o=0 
=d 


3 vB d= /2 go=2n 
=a (/ r3 ar) (/ sin 6 cos) (/ io) 
2ma* \Jr=0 6=0 o=0 
=, (ta') (})(Q2n) = = 
2na2 \4 = 8° 


Hence the centre of mass of the hemisphere is on the z-axis a distance 3a/8 from the 
origin. 


A.2. MOMENT OF INERTIA 


Suppose that we have the same system S as before and that CD is a straight line. 
Then the moment of inertia of S about the axis’ C D is defined as follows: 


Definition A.2 Moment of inertia The moment of inertia of the system S about the 
axis C D is defined by 


Iep = mi pir (A.6) 


where pj; is the perpendicular distance of the mass m; from the axis CD. 


Any system has a moment of inertia about any axis, but here we will suppose that the 
system is a rigid body B. Then the value of J¢p about an embedded axis is a constant 


* See Figure 3.7 (right) for a diagram of the volume element in spherical polars. 
¥ Tt is traditional to call CD an axis whether or not S is a rigid body rotating about CD. 
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and can be calculated once and for all. Calculation of moments of inertia required us to 
evaluate the sum (or integral) appearing in the definition (A.6). This process is illustrated 
by the following examples. 


Example A.3 Uniform rod (about a perpendicular axis through G) 


Find the moment of inertia of a uniform rod of mass M and length 2a about an axis 
through its centre and perpendicular to its length. 


Solution 


Consider an element of length dx of the rod; this has mass Mdx/2a. The contribu- 
tion to the sum in (A.6) from this element is therefore p?(Mdx/2a), where p is the 
distance of the element dx from the specified axis. On ‘summing’ these contributions 
over all the elements of length of the disk, we find that 


I “ a4 aa 2d 
= x => x" dx, 
CD aa P 5) 


- GJ 


since p = |x| for the specified axis. Hence 


M 
leo = 5 (3a?) = Ma’. 
Hence the moment of inertia of the rod about the specified axis is 3M a’. / 


Example A.4 Uniform circular disk (about its axis of symmetry) 


Find the moment of inertia of a uniform circular disk of mass M and radius a about 
its axis of symmetry. 


Solution 


Consider an element of area dA of the disk; this has mass Md A/za?. The contribu- 
tion to the sum in (A.6) from this element is therefore 


(MaA/xa’) p*, 


where p is the distance of the element from the symmetry axis. On ‘summing’ these 
contributions over all the elements of area of the disk we find that 


M 
Iep = —3 / peda, (A.7) 
a> JA 
where the integral is taken over the region A occupied by the disk. 


This double integral is most easily evaluated using standard plane polar coordi- 
nates r, 0. In these coordinates* dA = (dr)(r d0) = r dr d@, and p =r. The ranges 


* See Figure 3.6 (right) for a diagram of the element of area in plane polars. 
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of integration for, 0 areO <r <aand0 <6 < 27. We therefore obtain 


r= 6=2n 
Icp = nt 3 dr do 
d= 


Hence the moment of inertia of the disk about its symmetry axis is 5M a’. / 


Example A.5 Uniform solid sphere (about any axis through G) 


Find the moment of inertia of a uniform solid sphere of mass M and radius a about 
an axis through its centre. 


Solution 


Consider a volume element dv of the sphere; this has mass Mdv/ (37a”). The con- 
tribution to the sum in equation (A.6) from this element is therefore 


(Mav/(4xa°)) p?, 


where p is the distance of the volume element dv from the axis CD. On ‘summing’ 
these contributions over all the volume elements of the sphere, we find that 


Icp = p’ dv, (A.8) 
Yy 


4ra3 
where the integral is taken over the region V occupied by the sphere. 

This volume integral is most easily evaluated using spherical polar coordinates 
r, 0, @ centred at the centre of the sphere, with the line 6 = 0 lying along the axis 
CD. In these coordinates* dv = (dr)(r d@)(r sinOd¢@) = r? sin6 dr d@ dg, and 
p =r sin@. The ranges of integration for’, 9 and @ areO <r <a,0 <0 <a and 
0 < @ < 27. We therefore obtain 


g— sj o= 20 
Icp = ae f. r* sin’ 6 dr dd do 
4ra3 = o= 
3M r=a d= g=2n 
= ( fi r4 ar) ( / sin® 0 / do 
4ra3 \J,-=0 6=0 ¢=0 
3M (1 5\ (4 2 a7 2 


Hence the moment of inertia of the sphere about an axis through its centre is 2M a’, 
a 


* See Figure 3.7 (right) for a diagram of the volume element in spherical polars. 
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TABLE OF MOMENTS OF INERTIA 


Body Axis Moment of inertia 
Thi d o 
in ro . So 
mass M length 2a . Icp = 3Ma 
D 
Circular hoop BX 
ircu 7 ; 
mass M radius a / Icp = Ma 
= 
Girecrdlek eB, 
ircular dis “ ies 
mass M radius a » Icp = 3Ma 
(G . 

Solid sphere A , 
mass M radius a Icp = 3Ma 
Spherical shell ees 
mass M radius a Icp = 3Ma 


Circular cylinder 
mass M radius a 
length 2b 


Icp = 4Ma? + $Mb? 


Table 3. Some useful moments of inertia. All the bodies are uniform and in each case 
the axis C D passes through the centre of mass. 


Table of moments of inertia 


We can now build up a table of the most useful moments of inertia. Table 3 consists of the 
three results derived earlier together with one that is obvious (the circular hoop), and two 
that are new* (the spherical shell and the circular cylinder). 

It may seem that there are some obvious omissions. There is no mention of the circular 
cylinder about its axis of symmetry, nor the rectangular plate about any axis. However, 
the table has been deliberately kept short to make it more memorable. Any other moments 
of inertia that are likely to be required can be quickly deduced from those in the table by 
the methods given below. We begin with a couple of use ful tricks. 


* These two are less common than the others. They can be obtained by integration. 
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Two useful tricks 


The first trick is based on the simple observation that the moment of inertia of a body is 
unaltered if its masses are moved in any manner paraille! to the specified axis. In particular, 
the moment of inertia of a body is unaltered if it is squashed into a /amina perpendicular 
to the specified axis. 

Suppose for example that the body is a uniform circular cylinder of radius a and 
length 2b. Then its moment of inertia about its axis of symmetry is the same as that of a 
uniform circular disk of the same mass and radius about its own axis of symmetry. But this 
moment of inertia is known from Table 3 to be 5M a’. Therefore the moment of inertia 
of the circular cylinder about its axis of symmetry is also 5M a’. As a second example, 
suppose that the body is a uniform rectangular plate occupying the region —a < x <a, 
—b < y < bof the (x, y)-plane. Then the moment of inertia of the plate about the x-axis 
is the same as that of a uniform rod of mass M and length 25 lying along the interval 
—b < y < b of the y-axis. But this moment of inertia is known from the table to be 
3M b?. Therefore the moment of inertia of the rectangular plate about the x-axis is 
also iM b*. Similarly, the moment of inertia of the plate about the y-axis is iM a’. The 
moment of inertia of the plate about the z-axis will be deduced in the next section by using 
the perpendicular axes theorem. 


To illustrate the second trick, suppose a uniform solid sphere of mass M and radius a 
is cut into two hemispheres. Then each of these hemispheres has mass 5M and (from the 


table) has moment of inertia 5 (2m a’) = 3M a” about its axis of symmetry, where M 
is the mass of the whole sphere. Hence, the moment of inertia of a hemisphere of mass 
M and radius a about its axis of symmetry must be 2M a’, which is the same formula* as 
that for the complete sphere! A similar argument applies whenever a body can be cut into 
a number of parts of equal mass that make equal contributions to the moment of inertia. 
The formula for the moment of inertia of each part is the same as that for the whole body. 
Thus the moment of inertia of a segment of a Terry’s chocolate orange about its straight 
edge is 2M a’, where M is the mass of the segment and a is the radius of the orange. 


A.3. PARALLEL AND PERPENDICULAR AXES 


In this final section, we prove two important theorems that enable many more 
moments of inertia to be deduced. These are the parallel axes theorem and the perpen- 
dicular axes theorem. 


Theorem A.1 Parallel axes theorem Let Ig be the moment of inertia of a body about 
some axis through its centre of mass G, and let I be the moment of inertia of the body 
about a parallel axis. Then 


* Tt is the same formula, but remember that M is now the mass of the hemisphere. 
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nm s 
a ~ 
B 
FIGURE A.3 The parallel axes theorem. The 
bold dashed lines are the two parallel axes. 
I =Ig + Ma? (A.9) 


where M is the mass of the body and a is the distance between the two parallel axes. 


This is a powerful result. It means that, if you know the moment of inertia of a body 
about some axis, then you can easily calculate its moment of inertia about any parallel 
axis. The proof is as follows: 


Proof. The configuration is shown in Figure A.3. The axis* {G, n} passes through G and 
{A, n} is the parallel axis. Here n is a unit vector parallel to the two axes, and A is chosen so 


that GA is perpendicular to the two axes. Then if GA represents the vector a, the magnitude 
|a| = a, the distance between the two axes. 

Let P be a typical particle of the body with mass m and position vector r relative to G. 
Construct the perpendiculars PB, PC to the two axes, as shown. Then, by applications of 
Pythagoras, 


PB* = GP? —GB* =|r\* —(r-n)* 
and 


PC? = AP? — AC? =|r—al*- ((r—a)-n) 


= (r—a)+(r—a)—(r-n—a-n) 
= (rl? + |a/* -—2r-a—(r-n)? 
= PB? +a? —2r-a. 


If we now multiply this equality by m and sum over all the particles of the body, we obtain 


N N N 
Il=Igt (om) a — 2) 0 mini -a=1¢+Ma’—2a-) mini 


i=1 i=1 i=l 


= Ig + Ma? —2a-(MR), 


* The notation {G, n} means ‘the axis through G parallel to the vector n’. 
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FIGURE A.4 The perpendicular axes theorem. LE 
The axes OX, OY lie in the plane of the 
lamina and OZ is perpendicular. 


where R is the position vector of G. But since G is itself the origin of position vectors, R = 0 
and we obtain 


I =Ig + Ma’, 


which is the required result. Ml 
Example A.6 Rectangular plate (about an edge) 


A uniform rectangular plate has mass M and sides 2a, 2b. Find its moment of inertia 
about a side of length 2a. 


Solution 
Suppose the plate occupies the region —a < x <a, —b < y < bof the (x, y)-plane. 
It is already known that the moment of inertia of the plate about the x-axis is 3M er, 
The specified axis is parallel to the x-axis, the distance between the two axes being 
b. Hence the required moment of inertia is given by the parallel axes theorem to be 
3Mb? + Mb? = MD". & 

We will do more examples later, but first we will prove the perpendicular axes theo- 

rem. 


Theorem A.2 Perpendicular axes theorem Suppose the body is a lamina and that O 
is some point in its plane. Let OXY Z be a set of Cartesian axes with OX, OY in the plane 
of the lamina and OZ perpendicular (see Figure A.4). Then the moments of inertia of the 
lamina about these three perpendicular axes are related by 


loz =lox +loy (A.10) 


that is, the moment of inertia of the lamina about the axis perpendicular to its plane is the 
sum of the moments of inertia about the two in-plane axes. 


Proof. Let P be a typical particle of the lamina with mass m and coordinates (x, y, 0) in the 
system OXY Z. Then px, py, pz, the perpendicular distances of P from the axes OX, OY 
OZ are given by 


1/2 
px=ly  pr=bh pz = (x? +9") 
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from which it follows that 
Dz = Px + Py- 
If we multiply this equality by m and sum over all the particles of the body, we obtain 
loz =lox +loy, 


which is the required result. Ml 


Example A.7 Rectangular plate (about the perpendicular axis through G) 


A uniform rectangular plate has mass M and sides 2a, 2b. Find its moment of inertia 
about the axis through its centre perpendicular to its plane. 


Solution 


Let the axes OXY Z be placed so that O is at the centre of the plate with OZ perpen- 
dicular to its plane. Then, since the plate is a lamina, it follows from the perpendicular 
axes theorem that 


Toz =lox +loy. 
But it is already known that Jox = 5Mb* and Ioy = 5Ma* so that [oz = 5Mb" + 
5Ma* = 5M (a? + b?). Hence the required moment of inertia is 5M (a? +b). # 
Example A.8 Circular disk (about a diameter) 


Find the moment of inertia of a uniform circular disk of mass M and radius a about 
a diameter. 


Solution 


Let the axes OXY Z be placed so that O is at the centre of the disk with OZ perpen- 
dicular to its plane. Then, since the disk is a lamina, it follows from the perpendicular 
axes theorem that 


Toz =lox + loy. 


But Jgz is known to be 5M az and, by the rotational symmetry of the disk, Jox = 
Toy. It follows that lox = Ioy = iM a”. Hence the moment of inertia of the disk 
about any of its diameters is iM a’. / 


Example A.9 Rectangular block (about any of its edges) 


A uniform rectangular block of mass M has edges of lengths 2a, 2b, 2c. Find its 
moment of inertia about any edge. 


Solution 
Suppose the block occupies the region -a < x <a,-b< y<b,-c <2z<c. 
We will first find its moment of inertia about each of the coordinate axes. Suppose 
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the block is squashed parallel to the x-axis into a lamina of mass M occupying the 
rectangular region —b < y < b, —c < z < c of the (y, z)-plane. The moment of 
inertia of this ‘plate’ about the x-axis is already known to be 4M(b* + c”) and the 
block must have the same moment of inertia about the x-axis. Hence the moments of 
inertia of the block about the x- y- and z-axes are 5M (b? 4s @?), iM (a? +c?) and 
5M (a? + b”) respectively. 

Consider now an axis lying along an edge of length 2a. This axis is parallel to 
the x-axis, the distance between the rwo axes being (b? + c?)!/2, Hence the moment 
of inertia of the block about this edge is given by the theorem of parallel axes to 
be 3M(b* +c¢*)+ M(b? +c?) = $M (b? +c”). Hence the moments of inertia 
of the block about the edges of lengths 2a, 2b, 2c are $M (b? + c?), $M (a? +c), 
iM (a? + b”) respectively. ml 
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Answers to the problems 


Chapter 1 Vectors (page 21) 

1.1 (i) 18i+17j —26k, 6. (ii) 3, 5, cos-'(14/15). (iii) 1/3, —9/5. 

(iv) 4i+2 743k, —20i-13 7 —15k, 9i—12k. 

(v) +3. The set is right-handed. (vi) —11i+ 707 — 46k. 

1.2 cos~!(1/3) 

1.3 c=b-—a, d=-a, e=-b, f=a-b. 

1.14 6i+3° tk, 6i+6¢ j, —6t(t+3)i + 6(¢ +3) f + 12t(t? —2)k. 

1.15 2v-0, (v- kK)u+(v- k)d, [v, 9, k]. 

1.16 ¢=—sindi+cosé j, n= —cosdi—sind j, ck! =a. 

1.17 t = (—asinOi+acosé j +bk) /(a* +.b*)'/?, n = (—cos@i — sind j), e~! = (a7 + 
b*)/a. 

1.18 t= (pi+ f)/(+p*)'?, n= G—pj/A+ p?)'”, «7! = 2a + p?y??. 


Chapter 2 Velocity, acceleration and scalar angular velocity (page 43) 


2.1 v = 12¢ — 3¢2ms—!, a = 12 —6t ms. P comes to rest when¢ = Oandt = 4s. At 
these times its displacement is 1 m and 33 m respectively. 


2.2 v =3t?—4t—15ms—!, x =f3 — 2r? — 15t + 20 m. P comes to rest after 3 s. (The time 
t = —5/3s is before the motion was set up.) At this time P has displacement — 16m. 


2.3 If the acceleration were constant, then, from v = u+at,a = 14.9 fts.*. (Use feet and 
seconds as units.) However, from s = ut + 5at?, a = 20.3fts~?. This indicates that the 
acceleration was not constant. 


2.4 |v| = (27b7 +. c*)!/7, Jal = 27d. 


2.5 0.034ms~? directed towards the centre of the Earth, 0.0060ms~2 directed towards the 
Sun. 


2.6 v = (Qhe)F + (Qhe™) 8, a = (227 be“) 4. 

2.7 cos! (b/(b? + art)!/?), 

2.8 |a| = v"/(bcosa). 

2.9 v = 2bt~72(t — t) F + bt t (2 — t)0. Minimum |v] occurs when t = t. Whent = T, 
a= —3bt~7F. 

2.13 Maximum speed relative to ground is 120 mph. |a| = 121g. 

2.14 |v] = Qb(cosh2Qxr)!/2, a = (222 sinh 21) 6. 


2.15 Maximum speed is we, maximum acceleration is we. 


2.16 wb (1 — (b/c). 
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2.17 Speed of C is (1b + w2b2). Angular velocity of gear is (w@2b2 — w1b1)/(b2 — 51). 
Angular velocity of arm is (1b; + @2b2)/(b1 + b2). 


2.18 The angular velocity @ = —Qbcos Qt (a? — b? sin? ot)”, and the speed of C is 
Qab| cos 2t| (a2 — b sin? wt)”. 
2.19 Bearing is 18.6° west of north, time taken is 3h 10m. 


2.20 aU sina (U? +u2+2Uu cosa) '/”, 


2.21 Maximum range is Ro(v? — u7)!/?/v, achieved when 9 = +7/2. 


Chapter 3  Newton’s laws and gravitation (page 71) 

3.1 /6m?G/a*, V/6m?G/4a?. 

3.2 [V3+ V3/2+ 1/3] mG /a?, 4[1 + 1/V2+1/GV3)] mG /a?. 
3.3 mMG/(x” —a*), (M*G/4a?) In (b*/(b? — 4a*)). 

3.4 (2MM'G/a’b) [a+b — (a* +b*)'/7]. 

3.6 (mMG/a)r. 

3.7 21MoG. 

3.8 3M’G/4a’. 


Chapter 4 Problems in particle dynamics (page 98) 

4.1 57. 37 /2M, 27. 

4.2 (m(b—z)/b+M)g, 3(m(b—z)/b+M)g. 

4.4 (sina — zcosa)g. If > tana, the motion will come to rest and remain at rest. 
4.5 20ms"!. 

4.6 1445m. 


4.7 Terminal velocity is (U 24 V7)!/2. inclined at an angle tan—!(U /V) to the downward 
vertical. 


4.8 tan~!(ebEg/mu7). 

4.9 (2h — gt*)?/8et?. 

4.10 r = [a?/? + 3(MG/2)!/? ¢] 
4.11 /2a?/y!/?. 

4.12 65 days. 


2/3 ‘ oe —— 
/ , which tends to infinity as ¢ tends to infinity. 


4.14 The acceleratons of the three masses are g/9 downwards, 7g /9 upwards and 5g/9 down- 
wards respectively. 


4.15 r =acoshQt, N = 2maQ? sinh Qt. 

4.17 Maximum height is (V*/2g) In[1 + (u?/V7)], time taken is (V/g) tan~!(w/V). 

4.18 Calculated descent times are 2.32 s and 2.27 s, which could hardly have been distinguished. 
4.19 Maximum height is (V7/2g) tan~!(u?/V7). 

4.20 Charge is 6ap(v; + v2)/Eo, where the droplet radius a = 3(uv1/2pg)!/”. 
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4.21 Ranges are 55 m and 57 m respectively. 

4.22 Projection angle is 60°. 

4.23 Man is not safe. At 60m away he would need to be at least 25 m high. 

4.24 Least projection speed to clear hill is (g (a? +h)!" gh) '? Least muzzle speed needed 
to cover the hill is 20ms~!, so this gun can do the job. 


4.25 Landing point has displacement (u7/g) (sin(2aw + 6) — sin B) / cos” B up the plane. RY = 
(u7/g)(1 — sin B)/ cos B, R? = (u*/g)(1 + sin B)/ cos? B. 


4.27 Path is an inclined straight line. 


4.28 Estimated mass of Earth is 6.02 x 107+kg. Calculation supposes the Earth fixed and so 
ignores the motion of the Earth induced by the Moon. Radius of geostationary orbit is approxi- 
mately 42,000 km. 


4.30 Tension is (mg /V2) + (mu? /a), normal reaction is (mg/V2) _ (mu?/a), maximum u 
is (ag//2)'/*. If u exceeds the maximum, the particle loses contact with the sphere. 


4.31 Angle turned by string is u/Kb. Tension is (mu7/b)e~7*'. 


4.32 Particle of mass m meets y-axis again at the point y = —2mV/eBo. 


Chapter 5 Linear oscillations and normal modes (page 126) 
5.1 Amplitude is 2, time taken is 7/6. 

5.2 In parallel, w” = QF + Q3. In series, w? = Q5N5/(QT + 3). 

5.3 Distance travelled is a (1 + e~7*/"). 

5.4 Greater than 8. 

5.5 a= 16.3, B=4.4. 

5.8 Lower block leaves floor after time (27/3) (a/g)!/*. 


1 


5.9 Angular frequencies less than 20 radians s~! and greater than 40 radians s~! are safe. 


5.11 Amplitude is ( 22 + 4K? p? yo 
10 


(Q2 = p*)2 +4K?2p? 


5.12 Response is 
ar 3 (-D"*!\ ( (Q? =n?) sinnt + 2Kn cosnt . 
n (Q?2 — n?)? + 4K?2n2 


5.13 Period is 2(7 + 4)/Q. 
5.14 Body comes to rest at x = +Fo/(2Q7). Time taken is 277/Q. 


5.15 Period is 7(Q + Qp)/(QQp), where Qp = (ox _ K?)1/2, Ratio of sucessive maxima is 
—Kx/Qqp 
e : 


5.16 Slow mode: a, = n//6, A/B = 2/3, where n = (a/m)!/?. Fast mode: w2 = n, 
A/B = —1. General motion is 

xX = 26; cos(wit — y1) + 62 cos(w2t — y2), 

y = 35) cos(@it — y1) — 52 cos(w2t — y2), 
where 61, 52, 71, y2 are arbitrary constants. 
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5.17 Slow mode: @, = n, A/B = 1, wheren = (T/ma)!/?. Fast mode: w2 = J3n, 
A/B = —1. General motion is 
xX = 6; cos(@jt — y,) + 62 cos(w2t — y2), 
y = 6 cos(@jt — y1) — 62 cos(w2t — y2), 
where 61, 52, 71, y2 are arbitrary constants. General motion is not periodic. 
5.18 Slow mode: a; = /2/3n, A/B = 1/2, where n = (g/a)!/*. Fast mode: a, = J2n, 
A/B = —1/2. 
Chapter 6 Energy conservation (page 151) 
6.1 Both are 16J which illustrates the energy priciple. 
6.2 Man works at 500 W, which is quite a feat. 
6.4 3.0 x 10°J. 
6.5 Motion takes place in the interval 3 < x < 6 and the period is 92/2. 


6.7 Particle penetrates a distance (3mu? /2K ) ue 


6.8 V = —2mMG (a? +x?) and vmax = (4MG/5a)!/2, 

6.9 V = (2mMG/a’) [z — (a? + z*)!/*] and speed on impact is (3MG/3a)'/?. 

6.10 Projection speed is (aa*/4m)!/?. 

6.11 Spring is compressed by a/3. 

6.13 Minimum speed needed is (g(a + 2h))!/ 2. When h = 5a the mortar should be placed a 
distance a (v3 - 1) /2 from the wall of the building and inclined at 60° to the horizontal. 


6.14 Escape speed is (ae*/m(b* — a’)) os 


6.15 Escape speed is (a3e?/mb? (b* - a)", 


6.16 (i) is stable while (ii) is unstable. 
6.17 Arrival speed is (4bg)'/2, time taken is (4s (a2 + b?)/gb)'”. 


6.18 When a/b is small, period is approximately 


b\'? a 
In { — | oe eae ea 
(=) ¥ ay 


6.20 Time taken to hit post is [(8/9) + (2/5) In3]a/u. 
6.21 String makes 60° with upward vertical when it first becomes slack. 


6.22 Car was parked about 5 m from edge of building. 


Chapter 7 Orbits in a central field (page 188) 
7.1 Distance of closest approach is (p2V* + y)!/2/V. 
7.2 a=c, b=csina. 

7.4 a=~VJ2c cos(5a), b = V2c sin(5a). 

7.5 120°. 
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7.6 Deflection angle is 2tan~!(MoG/pV7) and distance of closest approach is (p?V* + 
MoG)!/7/V. 


7.7 Path isr = (4p/15)/sin(40/15). Distance of closest approach is 4p/15. 
7.9 Path isr = acos(@/3). Time taken to reach centre is ma? /(/2y). 

7.10 Apsidal angle is 7/(1 — €)!/*. 

7.11 Advance of perihelion is 27 per ‘year’. 

7.12 Advance of perihelion is —47r*e per ‘year’. 

7.13 Advance of perihelion is 67 MG /(ac”) per ‘year’. 

7.14 Differential scattering cross section is a7/4, a constant. 


7.15 Differential scattering cross section is 
my (a — 6) 


6) = 
0) = TagtOn — by sind 


7.16 Period was 89.6 minutes. Maximum speed was 7.84 km per second. 


7.17 Apogee is 3910 km above the Earth. Apogee speed is 5.50 km per second. Period is 128 
minutes. 


7.18 Spacecraft will escape if k > 2. Eccentricity of escape orbit is k? — 1. 
7.19 Spacecraft will go into orbit if 
9 2mG 
< ——_—__.. 
cu? + 2MG 
7.20 Time average of kinetic energy is y/(2a). 
7.22 Av’ = +3.13km per second, Av” = +0.83km per second. Travel time = 119 hours. 
7.23 The thrust should be applied in the direction of motion when the spacecraft is at its apogee. 


7.24 The most fuel efficient one-impulse strategy is to use the first impulse of the Hohmann 
orbit. Av = —2.50km per second for Venus, and 2.95 km per second for Mars. Hence the Venus 
flyby uses less fuel. The travel time is 146 days to Venus, and 259 days to Mars. 


7.25 The most fuel efficient one-impulse strategy is to apply the impulse parallel to the direction 
of motion, Av being chosen to give an orbit with the correct period. For a synchronous orbit, 
Av = 2.94km per second. The apogee is 121,000 km from the Earth’s surface. 

7.26 In the given approximation, r = Cine and 6 = (y?/Lo)et?*'. Hence r6 = 
(y/Lo)et*", which increases as t increases. 

7.27 In the given approximation, r = (Lo/Q)!/2e-*'/? and 6 = Q. Hence rd = 
(QLo)!/*e-'/?, which decreases as t increases. 


Chapter 8 Non-linear oscillations and phase space (page 214) 

8.1. When the oscillation has unit amplitude, @ = 1 — ze +0 (<*) and x(t) = coss + 
5 (coss — cos3s)é +O (e*) where s = wt. 

8.2 When the oscillation has unit amplitude, @ = 1 + xe +O (€*). 

8.3. When the maximum value achieved by x(t) is unity, x(t) = coss + z (-—3+ 


2coss +cos2s)é+O (<*), where s = (1 + O(e*)) t. The minimum value achieved by x(t) is 
-1-—4€+0 (ce). 
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8.4 At zero order, the limit cycle is a circle centre the origin and radius 2. Correct to order €7, 
o=1- Ke +0 (<3), and correct to order €, x(t) = 2coss + 5 (—3 sins + sin3s) +O (€’). 


8.9 The period of the limit cycle is 27. 
8.13 Correct to order €, the driven response is 


cos pt 3 cos pt cos 3 pt 2 
x(t) = O 
eee ge eee) 


which is valid when p # 1, 1/3, 1/5,.... 


Chapter9 Theenergy principle (page 241) 

9.1 Equilibrium position is 9 = tan~!(m/M). 

9.2 The equilibrium positions are (i) vertically downwards (unstable), and (ii) inclined at 60° to 
the upward vertical (stable). 

1/2 


9.3 Angular frequency is (4Vo / mb?) 
9.5 Acceleration of mass m up the plane, and the string tension, are 


M sina — m sin B d Mmg (si +sin 8) 
an sina + sin Bf). 
M+m e M+m 


9.6 Normal reactions on P and Q are 


img (7 cos@ + 2sin@ — 32) and zmg (4cos¢ + 5sind — 32) : 


9.7 Speed of rope on leaving peg is (ag)!/? 


9.8 The velocity and acceleration of the free end are given by 


2 x(2a — x) dv _ Qa? = 2ax x 
= a-—Xx ms dt (a — x)? me 


and the free end has acceleration 5g when x = $a. 


9.9 Amplitude of oscillations is a, period is 4 (a/g)!/”. 
9.10 Final speed of hoop is (v? + gh)". 
9.11 Acceleration of ball is 2¢ sing. 
9.12 Acceleration of yo-yo is 5 g. 
9.13 Speed of roll when its radius is b is 
(Ss 49 (b> — ”) - 
b? 3b 


9.14 Period of small oscillations is 
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— 1/2 
9.15 The speed of G is (Pea +b) - cos 6)) : 


9.16 Period of small oscillations is 27 (7(b — a)/5g)'/?. 


9.17 Period of small oscillations is 27 (b?/3ga) ne 


Chapter 10 The linear momentum principle (page 279) 
10.1 The time average of the apparent weight is Mg. 


10.2 The time average of the total force applied by the juggler is 10Mg. Juggling the balls as 
he crosses the bridge is worse than carrying them. The average force his feet apply to the bridge 
is 10M g and, since this is certainly not a constant, there will be times when the force exceeds 
10Mg. 


10.4 The reaction exerted by the support is 
_ MG (= + 2ax — =) 


R 
4a 


a—xX 


The support will give way when x = da. 


10.5 The speed of the vertical part of the chain when it has been pulled up a distance x is given 


by v? = (a — 3x). 


10.6 The speed of G is given by v* = 2 e(a +b). — cos @). 


10.8 The speed and height at burnout are 2100 ms~! and 100km. 


10.9 The maximum speed achieved is 
“(l-y7’) ~ ulny [! —Lany)e+0 (<)| 
€ 


when € is small; y = M/m. 

10.13 Velocity of composite particle is (m,v, + m2v2)/(m, + mz). 

10.15 The proton lost 14% of its energy and the recoil angle of the heium numcleus was 55°. 
10.16 The mystery nucleus has atomic mass 16 and is therefore oxygen. 


10.18 Neutrons of energy zE will be found at angles of 30° and 60° to the incident beam. 
Neutrons of energy zE will also be found at these angles. 


10.20 The dark companion in Cygnus X-1 has a mass of about 16Mo. 


10.21 The emerging particles are scattered alpha particles and recoiling helium nucleii, which 
are identical. The angular distribution of emerging particles is 


q* ( cosé 1 
o\ an ona yt 
E sin’ 0 cos’ 0 


where @ is measured from the direction of the incident beam. 


10.22 The scattering cross section for the incident neutrons is 


A 
of BL cos 6), 
a 
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and the angular distribution of the recoiling protons is the same, that is, (A/zr) cos 62. 


10.24 The rod will hit the table after a time 


1/2 px/2 (4 — 29\1/2 
(<) i 3 cos* 0 ‘ie 
3g 1/3 1 —2cosé 
Chapter 11 The angular momentum principle (page 317) 
11.2 Angular speed of target after impact is 


4mb 
—_——__ } n. 
Maz + 4mb?2 


11.3 Angular speed of cylinder after impact is 


( 2mb ) 
a2(M +2m)) 


11.4 Angular speed of disk is z(a? /b*)Q and increase in kinetic energy is 


Ma? 


Gal = Sp 70". 


11.5 Angle of the new conical motion is 84° approximately. 


11.6 In the elastic case, the speed of the ball after impact is 
f=, m b? 
—]}u, where B=—j{1+-—]. 
1+ 6 M k2 


1/2 tan~! (ut 
r= (a? +0?) ee 
sing 


11.8 In Case A, 


1/2 
In Case B, the required value of u is (Jas) F 


11.9 After one lap by the bug, the hoop has rotated through the angle 


2m 
—_____. } 37, 
M+2m 

11.10 Period of small oscillations is 
1/2 
a? + 3b" : 
2x | ———— ; 
3gb 


11.11 Speed of ball after the onset of rolling is 3V and : of the kinetic energy is lost in the 
process. 
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11.12 Ball will slide if tana > i wand will roll if tana < i ww. If ball slides, the acceleration is 
g(sina — jxcos a). If ball rolls, the acceleration is 3 gsing. 


11.13 Bug will reach top of disk if 


2 8mag 
M + 2m" 


11.14 Acceleration of yo-yo is 3 g- 5Z downwards. 


11.15 Forward velocity of cylinder is 


11.16 
S = ty Gy —2)Mg sind, K =5y7*(1—y)Mgasiné. 


11.17 Reactions at B and C are (dmg) i and (img) j. Reaction at the floor is 
zmg (-i-jt+hk). 


Chapter 12 Lagrange’s equations and conservation principles (page 361) 
12.1 Closed chain has N degrees of freedom. There are four conserved quantities (two hori- 


zontal components of linear momentum, the vertical component of angular momentum about G, 
and kinetic energy). Motion can be determined from conservation principles if N < 4. 


12.2 Upward acceleration of mass m is 22/5. 


12.3 The accelerations of the three masses are (in order) g/9 downwards, 7g/9 upwards and 
5g/9 downwards. 


12.4 Period of small oscillations is 21 (4a/(3g sina))!/?. 


12.5 Normal reaction is mg(7 cos 6 — 4)/3. It is not realistic to assume that rolling persists until 
the normal reaction becomes zero. This would require an infinite coefficient of friction! 


12.6 p, = (3/2)Mx+mx+b 6 cos @), which is not the horizontal linear momentum. Period 
of small oscillations is 27 (3Mb/(3M + 2m)g)!/?. 


12.7 Accelerations are (i) 2g sina cosa/(7 — 2 cos” a), (ii) Sg sina/(7 — 2 cos? @). 
12.8 Reaction of the floor is mg (4 — 6 cosa cos @ + 3cos? 4) / (1+ 3 sin? 0). 

12.9 po = mR76, which is the vertical component of angular momentum about O. Kinetic 
energy is not conserved because the force pulling the string down does work. Tension in the 
string is m ((L7/R?) - R). 

12.10 Solution is = a cosh Qr. Energy function h = sma? (7? - Orr} = —5ma?Q?, which 
is constant. 


12.11 Downward acceleration of yo-yo is (2g — Z) /3, SO yo-yo can remain at same height (or 
move with constant velocity) if Z = 2g. The total energy at time f is M (Z* + 2gZ) /6. 


12.12 Neither of E and h is conserved in general. (Consider, for example, the case in which Z 
is constant.) 
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12.13 Angle turned by hoop is 27m /(M + 2m). 
12.14 U=-—f(t)x. 
12.15 The conserved momenta are py = mx + etz and py = my. 


12.17 Conserved momentum pg = mr?6, which is the component of angular momentum about 
O perpendicular to the plane of motion. 


2 


12.18 Conserved momentum pg = mr sin? a @, which is the angular momentum about the 


axis of the cone. 
12.19 Conserved momenta are pp = mr76 and p; = mz — (euol /27) Inr. 


12.20 In terms of obvious coordinates, the conserved quantities are (i) the vector L (ii) P,, Py 
and L, (iii) L, (iv) L, (v) P; and L,. 


12.21 Conserved quantity is L, + cP. 


Chapter 13 Calculus of variations and Hamilton’s principle (page 388) 
13.4 x =71442. 

13.2 x =sinr. 

13.4 


13.5 

acos (57 sina) 
Qi a 
cos (9 sina) 
13.6 x =71(2r+ X — 8) /4. 


Chapter 14 Hamilton’s equations and phase space (page 413) 

14.1 G = —v} + 3v;v2 — 205 + 6wr) — 9wv2 — 9w”. 

14.2 The Hamiltonian is H = pe /(2m(a* + b*)) + mgb@ and the equations are 
6 = po/(m(a? +b?)), po = —mgb. 

14.3 The Hamiltonian is H = (p? + p?)/2m + mgz and the equations are 

x = p,/m, Z7=p:z/m, py =0, pz = —mg. The coordinate x is cyclic. 


14.4 The Hamiltonian is 


_ Pb Po 
Gag Sd ae mga cos 6 
and the equations are 
; Po : Po : P5 cos@ 
ear Om tty =. ite bo = 0. 


14.5 The Hamiltonian is 


: a* p> — 2a cos Opy po + 2p}, ‘nines 
2ma2 (2 — cos? 0) . , 
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14.6 The Hamiltonian is 


2 
Po 1, 52 
— 5mZ Z)cos@ 
mean eo ee 


and the equations are 6 = pg/m(a — Z)*, po = —mg(a — Z)sin@. Since H has an explicit 
time dependence through Z(r), it is not conserved. 


14.7 The Hamiltonian H is conserved when the fields {E, B} are static. 


Chapter 15 General theory of small oscillations (page 452) 
15.1 w) =a/6m, w; = 3a/4m, a = (2,3), a2 = (4, —1). In the subsequent motion, 


2u F : 
x (@2 Sin@,t + 6@)sinw2t), y= 
2 


Uu . 3 
= —— ——— (2 sinw;t — w; sinwoft). 
14a@,0 w2 


14a 


15.2 w, =a/6m, w; = a/m, a, = (2,3), a2 = (1,-1). The variables yn} = x + y, 
n2 = 3x — 2y are a set of normal coordinates. 

15.3 wi = g/2a, w5 = 15¢/2a, a; = (5,6), az = (3, —2). The ratio t/t. = 15, which 
is irrational; so general small motion is not periodic. 

15.4 wt = g/2a, wy = 3g/2a, w% = 3g/a, a; = (1,2,3), a = (1,0,-3), a3 = 
(1, —3,3). The variables n} = 30 + 26+ W, n2 = 30 — Ww, 73 = 30 — 36+ W are a set of 
normal coordinates. 


15.5 The longitudinal modes have frequencies or = a/m and ws = 3a/m, and the transverse 


modes have frequencies w3 = To/ma and wy = 3T9/ma. 

15.6 w, = 49/9a, w; = g/a, wy = 4g/a. Then %)/t, = 3/2 and t/t; = 3 which are both 
rational. The general small motion is periodic with period 62 (a/g)!/?. 

15.7 The longitudinal mode has frequency or = 11g/48a, and the transverse modes have 
frequencies ws = g/6a and W3 = 3¢/4a. 

15.8 The transverse modes have frequencies ws = 12T9/5Ma and 3 = 187T9/Ma. 


15.9 The j-th normal frequency is 


a) i jn (1 <j <n) 
oj =2(— sin | ————_ n). 
4 m 2(n + 1) = i= 
15.10 The j-th normal frequency is 
Ty \ 1/2 ; 
6=9( =") sin = — (<j<n). 
, ma 2(n + 1) 


15.11 Vibrational frequencies are or =a/m, ws = 6a/m. Estimated frequency ratio is 2.54, 
remarkably close to the measured value of 2.49. 


15.12 The antisymmetric mode has frequency W3 = 7k/4m, and the symmetric modes have 
frequencies or = k/4m and ws = 2k/m. 
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Chapter 16 Vector angular velocity and rigid body kinematics (page 467) 


16.1 Angular velocity is 2k radians per second, particle velocity is 6i + 8jms~!, speed is 
10ms~!, and acceleration is —16i + 12 j ms~?. 


16.2 Angular velocity is 2i+2j+k radians per second, particle velocity is 3i—6j+6kcms~!, 
speed is 9cms~!, and acceleration is 18i — 9j — 18kcms~?. 


16.4 Angular velocity of penny is 6 (k —cosaa). Velocity of highest particle is zero. 
16.5 Highest particle has the greatest speed which is 2h cos? a|6]. 


Chapter 17 Rotating reference frames (page 489) 
17.3 Angular speed is A seca. 
17.4 r =acoshQ. 


17.5 Take the angular velocity of the rotating frame to be & = —(eB/2mc)k. The two frequen- 
cies are wo + (eB/2mc). 


17.10 r =acosh (3.2). 


17.11 Speed of cylinder is Qx //2. Turntable exerts force — 5MQ?x i+ J/2MQ?x j+Mgk 
on the cylinder, where i points in the direction of motion and k points vertically upwards. 


17.12 In cylindrical coordinates with Oz along the rotation axis of the bucket, the isobars are 
the family of surfaces z = (Q7/2g)r? + c, where c is a constant. The free surface of the water 
must be one of these, whatever the shape of the container. 


17.13 In cylindrical coordinates, the water occupies the region 5a <r <a. The pressure at the 
wall is po + 32a". 

Chapter 18 Tensor algebra and the inertia tensor (page 519) 

18.1 C and C’ have opposite handedness. 


18.2 The transformation matrices are 


1 0-1 100 1 (4 7-4 cf 148 
lo o) Ge lost ol) Gali 4 6) Geel 47-4 
V2\1 0 1 001 9\g-4 4 9\ 3 4 1 


The new coordinates of the point D are (3//2, —3, 3/4/2), (3, 3, 0), (-1,-1, 4), 
(—1, —1, —4) respectively. 


18.3 Solution for v is (1, 1, 0)’ or any multiple of it. The rotation angle about OE is 7/3. 
18.4 The transformation formula for a fifth order tensor is 


3.3 «3 


2 3 3 2 3 
/ 
lijkim _ >. a. > > s ip Q jg Akr Als Amt tparst 


poalgeal ral sa! fe1 


18.5 The matrices representing the tensor are 


i J21 1 10-1 
(i) |] 1V¥2 0 (ii) 01 0 
v2 1 072 =1o0 4 
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18.6 (i) Yes, order 7, (ii) Yes, order 6, (iii) Yes, order 1 (a vector), (iv) Yes, order 1 (a vector), 
(v) No, (vi) Yes, order 5, (vii) Yes, order zero (a scalar), (viii) No, (ix) Yes, order zero (a 
scalar). 


18.9 The most general material has 21 elastic moduli. 

18.10 Moment of inertia of the plate about a diagonal is 3M a’b* /(a* + b*). 

18.11 Principal moments of inertia of the disk are (i) iM’, 4Ma’, 1Ma?, (ii) Ma’, 
iM ae 3M a 

18.12 Principal moments of inertia of the top at A are Ma’, Ma’, 5Ma’. 


18.13 Principal moments of inertia of the top at A are Ma’, saMa’, 2 Ma’. 


18.14 Principal moments of inertia of the cube are (i) $Ma’, $Ma’, $Ma? (ii) Ma’, 
3 Ma’, 3Ma* (iii) Ma’, Ma’, 3 Ma’. The required moments of inertia are (i) $ Ma’, 
(ii) 3 Ma’, (iii) $Ma?. 

18.15 Principal moments of inertia are (i) }M(b? +c”), $M(a2 +c), 3M(a? +b’) (ii) 
3M (b* + 4c*), 3M (a? + 4c?), 3M (a? + b*). The required moments of inertia are 


2M (a*b? + a2c? + b*c”) (ii 2M (a7b? + 2a*c? + 2b?c?) 
11 
3(a2 + b2 +c?) 3(a2 + b2) 


(i) 


18.16 Principal moments of inertia at G are ;Ma?+ ;Mb?, ;Ma?+ 3Mb?, 5Ma’. Cylinder 
is dynamically spherical when b = /3a/2. 


18.17 Symmetries are (i) axial, (ii) none, (iii) none, (iv) axial, (v) axial, (vi) spherical, (vii) 
axial, (viii) axial, (ix) spherical. 


18.18 Principal moments of inertia at a corner point of the plate are £(5 + 3./2)Ma?, £(5 - 


3./2)Ma’, 2M a’. A set of principal axes is obtained by rotating the axes Cx1x2.x3 shown in 
Figure 18.3 through an angle of —z/8 about the axis Cx3. 


Chapter 19 Problems in rigid body dynamics (page 560) 

19.2 Radius of circle is tb and centre is a distance 3b from O. 

19.3 Ball is stable at highest point if n? > 35(a + b)g/a’. 

19.4 Precession rate is Mgh/Cn. 

19.6 For upright stability, the spin would have to be at least 3860 revolutions per second! 


19.7 Spin nedded for stability is (i) 9.3 and (ii) 6.2 revolutions per second. The juggler should 
use a hollow ball. 


19.11 The required moment is 5M a?Qn about the horizontal axis perpendicular to the symme- 
try axis of the wheel. 


19.14 The motion of the axis is approximately periodic with period 7/A. 
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acceleration bicycle 

in polar coordinates, 34 stability of, 559 

rectilinear, 26 binary star, 268 

vector, 28 period of, 267 
action functional, 384 brachistochrone problem, 378 
angular momentum, 289-306 Brahe, Tycho, 155 


about a point, 290 
about an axis, 291 


conservation of, 298-306 calculus of variations, 369-380 
about a point, 298 with many variables, 385 
about an axis, 299 central field, 157 
in planar motion, 307 angular momentum conservation in, 158 
of a rigid body, 292-293 whose orbits are all closed, 168 
about its rotation axis, 293 centre of mass, 58, 564, 564-567 
in general motion, 510 equation, 247 
angular momentum principle, 294-297 finding, by integration, 565—567 
about a fixed point, 295 finding, by symmetry, 564 
about the centre of mass, 296 motion of, 248, 247-250 
non-standard, 317 of a hemisphere, 566 
angular velocity of a semi-circular lamina, 565 
scalar, 37, 36-40 centrifugal force, 477 
vector, 457, 457-464 chaos 
existence of, 46/463 in Duffing’s equation, 213, 218 
transformation of, 508 not in integrable Hamiltonian systems, 417 
apse, 162 not in plane autonomous systems, 210 
apsidal angle, 162 charged particle 
apsidal distance, 162, 163 Hamiltonian of, 414 
astronomical units, 174 in a magnetic field, 96 
autonomous system(s), 200, 353, 404 Lagrangian of, 351, 363 
finding phase paths of, 200 circular motion, 92—98 
phase paths of, 408 non-uniform, 35 
properties of, 20/202 uniform, 30 
axis collision process, 255-264 
notation for, 572 elastic, 257, 257-259 


elastic collision formulae, 263 
energy conserved in, 258 


basis set, 6-8 energy principle for, 257 
handedness of, 7 in the ZM frame, 259-26] 
orthonormal, 7 linear momentum conserved in, 257 
standard, 8 meaning of, 255 


Bertrand’s theorem, 169 collision theory, see collision process 
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configuration 
of a system, 221, 325 
configuration space, 330-333 
conics 
geometry of, 184-186 
conservation principles 
and symmetry, 356-361 
constraint force(s), 78, 325 
that do no work, 146, 228 
constraint(s), 324-325 
examples of, 324 
geometrical, 324 
integrable, 328 
kinematical, 324, 328-330 
moving, 344-348 
examples of, 344 
Lagrange’s equations for, 345 
non-integrable, 329 
Coriolis force, 477 
Coriolis, Gaspard-Gustav, 475 
couple, 287 
cyclic coordinate, 355, 406 


d’Alembert’s principle, 335, 333-335 
d'Alembert, Jean le Rond, 334 
degrees of freedom, 222, 327, 
327-328 
directed line segment, 4 
drag, see resistance 
drag coefficient, 83 
Duffing’s equation 
driven, 211 
chaotic solutions of, 218 
sub-harmonic resonance of, 213 
undriven, 195 
dynamical balancing, 548 
dynamical symmetry, 5/6-5/8 
axial, 517 
none, 517 
spherical, 518 


elastic collision formulae, 263 
embedded frame, see Euler’s equations 
energy conservation 
for a constrained system, 229, 
228-233 
for a particle 
in 3-D motion, 143 
in a conservative field, /40-—/43 


in constrained motion, 146, /45—/46 
in rectilinear motion, 134, /33—/34 
for an unconstrained system, 227, 225-228 


energy function, 352 
conservation of, 353 
for a conservative system, 354 
energy principle 
for a particle, 132 
for a system, 224, 225-224 


equilibrium 
of a conservative system, 42/424 
condition for, 422 
stable, 423 
condition for, 423 
equilibrium position 
of a particle, /38—/39 
oscillations about, 139 
stability of, 139 
equivalence, principle of, 68 
escape speed, 78 
Euler’s angles, 535 
Euler’s equations, 544-549 
deficiences of, 548 
Euler—Lagrange equation(s), 374 
integral of, 377 
with many variables, 386 
extremals, 374 


Fermat’s principle, 381, 380-382 
field, see force, field of 
force 
definition of, 56 
external, 223 
fictitious, 477-478 
centrifugal, 477 
Coriolis, 477 
field of, 74 
conservative, 141 
internal, 223 
conservative, 226 
units of, 51 
Foucault pendulum, 482-484 
Foucault, Jean Bernard Leon, 484 
Fourier series, 117 
Fourier’s theorem, 117 
Fourier, Jean Baptiste Joseph, 117 
frames 
in relative motion, 40 
inertial, 53, 52-54 
mutually unaccelerated, 43 
rotating, see rotating frame(s) 
frisbee 
wobbling, 530 
functional 
extremal of, 374 
local minimum of, 372 
made stationary, 372, 385 
meaning of, 369 


generalised coordinates, 222, 326, 325-330 


generalised eigenvalues, 433 
of a symmetric matrix, 433 

generalised eigenvectors, 433 
orthogonality of, 444 

generalised force, 337 

generalised momenta, 354, 354-356 
conservation of, 355, 406 
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Index 


generalised velocities, 331, 330-332 
geodesic, 367 
grad operator 
n-dimensional, 399 
3-dimensional, 141 
gravitation 
by a distribution of mass, 60-67 
by spheres, 64-67 
constant of, 60 
law of, 59 
gravitational acceleration, 68 
gyrocompass, 54/544 


Hamilton form, 395, 396 
Hamilton’s equations, 402, 400-403 
in Poisson bracket form, 417 
Hamilton’s principle, 386, 383-388 
significance of, 387 
with one degree of freedom, 383 
Hamilton, Sir William Rowan, 367, 
402 
Hamiltonian function, 401 
conservation of, 404 
equation defining, 401 
explicit time dependence in, 402 
for a conservative system, 404 
Hohmann transfer, 177-178 
optimality of, 186-188 
holonomic system, 330 
Hooke’s law, 106 
Hooke, Robert, 106 


inertia tensor, 510, 508-516 
elements of, 511 
principal axes of, 515, 5/4-5/6 
principal moments in, 515 
inertia, law of, 53 
inertial frame, see frame, inertial 
integrable system(s), 302-306 
integrable systems, 273, 273-277 
invariance 
of potential energy 
under rotation, 358 
under translation, 356 
of the Lagrangian, 359 


inverse square field (attractive), 170-178 


E-formula for, 172 
Kepler’s equation for, 176 
L-formula for, 171 
orbits in, 170 
period formula for, 174 
period of elliptic orbit in, 173 
time dependence of motion in, 175 
inverse square field (repulsive), 179 
E-formula for, 179 
L-formula for, 179 
orbits in, 179 
isolated system, 250, 298 


Kepler’s laws, 156 
Kepler, Johannes, 155 
kinetic energy, 132 
of a rigid body 
in general motion, 238, 237-238 
in general rotational motion, 510 
with a fixed axis, 235, 234-235 


laboratory (lab) frame, 259 
Lagrange’s equations, 335-343 
for a conservative system, 339 
for a general system, 338 
in Lagrangian form, 349 
partial derivatives in, 336 
sufficiency of, 339 
summary of method, 339 
with moving constraints, 346 
Lagrange, Joseph-Louis, 323 
Lagrangian, 349, 348-351 
invariance of, 359 
Legendre transform(s), 400, 396-400 
active and passive variables in, 399 
with many variables, 399 
with two variables, 396 
limit cycle(s), 205-210 
mechanical example of, 209 
not in Hamiltonian dynamics, 
410 
of Rayleigh’s equation, 207 
of Van der Pol’s equation, 217 
proving existence of, 206-209 
line integral, 140 
linear momentum 
component, conservation of, 251 
conservation of, 250, 250-251 
of a particle, 245 
of a system, 245 
linear momentum principle, 247 
Linstedt’s method, 197-199 
Liouville’s theorem, 409 
on integrable systems, 417 
use in statistical mechanics, 415 
Liouville, Joseph, 409, 417 
Littlewood, John Edensor, 478 
Lorentz force, 350 


mass 
gravitational, 67 
inertial, 55 
units of, 51 
Maxwell’s equations 
in dynamic case, 363 
in static case, 350 
molecular vibrations, 435-440 
moment of a force, 286-289 
about a point, 286 
about an axis, 288 
geometrical meaning of, 287 
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moment of inertia, 234, 567, 567-575 
of a circular disk, 568, 570, 574 
of a cylinder, 570 
of a hoop, 570 
of a rectangular block, 574 
of a rectangular plate, 573, 574 
of a rod, 568, 570 
of a solid sphere, 569, 570 
of a spherical shell, 570 
parallel axes theorem for, 571 
perpendicular axes theorem for, 573 
table of, 570 
momentum, see linear momentum 
or angular momentum 
or generalised momentum 
multiple interactions, law of, 57 
mutual interaction, law of, 54 


Newton, Sir Isaac, 50 
Newton’s Laws, 51 
experimental basis of, 58 
Noether’s theorem, 359, 358-361 
Noether, Emmy Amalie, 358 
non-linear oscillations 
driven, 2//—214 
free periodic, 194-199 
Linstedt’s method, 197-199 
proving existence of, 195 
unsymmetrical, 215 
normal coordinates, 448, 448-452 
determination of, 451 
normal frequency, 123, 431 
degenerate, 435 
determinantal equation for, 430 
normal mode(s), 122, /2/—/26, 429 
amplitude vector of, 430 
existence of, 433-435, 435 
orthogonality of, 444-445 
orthogonality relations for, 445 
orthonormality relations for, 445 
Rayleigh’s principle for, 445 
T -matrix for, 428 
typical problems on, 436-444 
V -matrix for, 428 
normal vector (to a curve), 20-22 
nutation, 538 


ODEs 
first order systems of, 200, 393 
one-body problem, 157 
orbit 
aphelion of, 162 
apogee of, 162 
perigee of, 162 
perihelion of, 162 
orbital motion 
angular momentum equation for, 159 
effective potential in, 161 
energy equation for, 160 
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general nature of, 159-164 

nearly circular, 167-169 

path equation for, 165, 164-167 

radial motion equation for, 160 

stability of, 168 
orthogonal diagonalisation 

of a symmetric matrix, 514 
orthogonal matrix, 494 
orthogonal transformation, 495, 

493-499 

determinant of, 499 
oscillations (linear) 

coupled, /2/—126 

driven, 107 

free, 107 


particle, 51 
perturbation expansion 
Linstedt’s method, 197-199 
regular, 196 
phase diagram, 200 
for damped SHO, 203 
phase fluid, 408 
phase path(s), 200 
closed, 202 
in polar form, 215 
phase space, 199-21] 
(q, p)-space, 406-408 
(x, v)—plane, 202 
(x1, X2)—plane, 200 
planar motion, 306, 306-313 
of a rigid body, 308 
rigid body equations for, 308 
Poincaré—Bendixson theorem, 205 
applications of, 206-209, 217 
Poisson brackets, 416 
Jacobi’s identity for, 416 
polar coordinates, 32 
polar unit vectors, 32 
pool ball, see snooker ball 
position vector, 8 
potential 
electrostatic, 350 
magnetic vector, 350 
velocity dependent, 349-35] 
potential energy 
mutual, 226 
of a particle 
in 3-D motion, 141 
in rectilinear motion, 134 
of a system 
external, 225 
internal, 225-227 
total, 227 
under uniform gravity, 225 
precession 
of a top, 532-534, 539-541 
of planetary orbits, 169 
predator-prey equations, 200, 394 
periodicity of solutions, 216 
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product(s) of inertia, 511 
products of vectors, see vectors 
projectile motion, 88-92 

with resistance, 90-92 
pseudovector(s), 508 


recurrence theorem, 411 
stronger form of, 412 
reduced mass, 266 
reference frame, see frame 
resistance 
linear law of, 83 
physics of, 82 
quadratic law of, 83 
resisted motion, 82-87 
resonance, 116, //5—116 
sub-harmonic, 213 
super-harmonic, 213, 217 
resonant frequency, 116 
response 
driven, 113 
transient, 113 
Reynolds number, 83 
rigid body (dynamics of) 
axially symmetric, 527-544 
under no forces, 530, 527-531 
equations of motion for 
in general motion, 522 
with a fixed point, 523 
in planar motion, 306-313 
spherically symmetric, 524-527 
unsymmetrical, 549-556 
free motion of, 55/—556 
stability of rotation of, 553 
rigid body (kinematics of) 
in general motion, 460-467 
angular velocity of, 461, 462 
planar motion of, 38, 38-40 
with a fixed axis, 36, 457-460 
particle velocities in, 459 
rigid body equations, 297 
rigidity, 457 
conditions for, 460 
rocket equation 
in free space, 253 
under gravity, 254 
rocket motion, 25/—255 
rolling condition, 40, 325, 464 
rolling wheel, 556-559 
stability of, 558 
ropes 
discrete representation of, 231 
Rossby number, 484 
Rossby, Carl-Gustav, 484 
rotating Earth, 478-485 
apparent value of g on, 479 
apparent vertical on, 479 
geostrophic flow on, 485 
pendulum on, 482-484 
projectile equation for, 480 


projectile motion on, 480 
wind on, 454-485 
rotating frame(s) 
apparent acceleration in, 475 
apparent angular velocity in, 474 
apparent velocity in, 473 
basis vectors in, 470 
true rates of change of, 472 
energy conservation in, 487 
energy principle in, 486-488 
form of Second Law in, 476 
linear momentum principle in, 485 
true and apparent values in, 471 
Rutherford, Sir Ernest, 179 
Rutherford scattering, 179-184 
one-body formula, 184 
two-body formula, 272 


SI units, 51 
scalar (invariant), 500 
scattering cross section, 181 
formula in axisymmetric case, 183 
general relation satisfied by, 182 
seismograph, 120 
SHM, see simple harmonic motion 
SHM equation, 107 
with damping, 109 
with damping and driven, 112 
simple harmonic motion 
driven, //2—/19 
driven by harmonic force, //2—/15 
driven by periodic force, //7—119 
free, critically damped, 127 
free, damped, /09-/11 
free, overdamped, /// 
free, undamped, 107-109 
free, underdamped, //0-111 
simple pendulum, 93-96 
exact equation for, 94 
exact period of, 95 
linear theory of, 94 
Snell’s law, 380 
snooker ball 
dynamics of, 525-527 
putting ‘side’ on, 465 
rolling of, 464 
speed, 29 
spherical pendulum, 302 
spinning top 
Lagrangian for, 536 
solved analytically, 535-541 
solved vectorially, 53/534 
spring 
linear, 106 
spring constant (strength) of, 106 
non-linear 
hardening, 194 
softening, 194 
restoring force of, 106 
standard mechanical system, 335 


595 


596 


statics 

of rigid bodies, 3/3-316 

equations for, 314 

Stokes drag formula, 83 
Stokes, George Gabriel, 83 
string 

inextensible, 80 

tension in, 80 
summation convention, 502 
symmetry 

and conservation principles, 356-361 
system(s) 

constrained, 221 

unconstrained, 221 


T -matrix, 428 
tangent vector (to a curve), 15-20 
tensor(s) 
contraction of, 506 
identity, 505 
isotropic, 505 
outer product of, 505 
second order, 501 
in matrix form, 503 
summation convention for, 502 
third order, 502 
vector regarded as, 501 
three-body problem, 417 
top, see spinning top 
torque, 286 
transformation matrix, 494 
of a given reflection, 498-499 
of a given rotation, 495-498 
two-body problem, 264-269 
reduced mass in, 266 
relative motion equation, 266 
relative motion in, 266 
two-body scattering, 269-273 
cross section formula for, 271 
in Rutherford case, 272 
in Rutherford case, 270 
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V -matrix, 428 

variation, 371 
admissible, 371 
norm of, 371 

variational principles, 380-388 

vector algebra, 3-16 

vector calculus, /6-/8 

vector geometry, 8—10 

vector quantity, 3 

vector(s), 4 
as first order tensors, 501 
basis set of, 6—S 
components of, 12, /2—13 
difference of, 5 
differentiation of, 16, /6-/8 
scalar multiple of, 5, 5—6 
scalar product of, 10, /0-/2 
sum of, 5, 5 
triple scalar product of, 15 
triple vector product of, /5—/6 
unit, 6 
vector product of, 13, 13-14 

velocity 
circumferential, 35 
in polar coordinates, 34 
rectilinear, 26 
vector, 28 

velocity diagram(s), 273, 

273-274 

virtual motion, 334 

virtual work, 334 
principle of, 335 


weight, 69 
work 
done by a force, 132, 133, 
141 
SI unit of, 132 


zero momentum (ZM) frame, 260 


